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PREFACE. 


WITHIN the past twenty years a revolution has taken 
place in the mental attitude which the physicist main- 
tains towards the concepts which have been invented 
in the past in order to reach those broad generalisations 
which are the proud possession of his science. 

The demand on the part of the layman to know just 
what this revolution portends has been satisfied, as far 
as may be, by a liberal supply of popular works on 
Relativity. But the science undergraduate taking his 
normal courses in the University classroom, or reading 
his text-books of Physics and Mathematics, is anxious 
to ascertain in a more precise manner what changes 
this new idea is producing in the principles and content 
of physical science. It is primarily for him that this 
book has been written. Unfortunately it is only too 
easy to acquire the notion that the new knowledge is 
dealing out death and destruction to the principles won 
so laboriously since the time of Galileo and Newton. 
To correct such a disastrous misapprehension it is 
highly desirable that the mind should be gradually 
_ adapted to the new idea as the usual University courses 
are pursued, and not be compelled suddenly to readjust 
its point of view after the work involved in obtaining 
a degree has been completed. In short, Relativity 
must not remain something whose full significance 
can only be grasped by a small band of highly-trained 
specialists, but must be regarded as a selective principle 
with which the young student should make as early 
an acquaintance as possible. 

Since the undergraduate has been familiarised in 
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his school and first year courses with the elements of 
dynamical science, this book begins, after a general 
Introduction, with a discussion of the modifications in 
Kinematics and Dynamics required by the acceptance 
,of the Relativity standpoint. Historically, of course, 

the new theory grew out of the theory of the electro- 

(magnetic field; but as the treatment of the field by 
means of Maxwell’s equations is a branch of physical 
science which comes comparatively late in the usual 
courses, the relativity of the field equations is discussed 
after consideration of the equations of Dynamics.* In 
these earlier chapters, the mathematical method used 
is familiar and well within the student’s powers. 

In Chapter V. a systematic development of the 
Tensor Analysis, which has proved to be the suitable 
mathematical medium for the application of the Rela- 
tivity test, is initiated, and the remaining «chapters 
of Part I. deal with the discovery by its aid of those 
forms for the differential equations of dynamical and 
electromagnetic theory which are compatible with the 
restrictions imposed on the transformations of co- 
ordinates in Einstein’s first statement of the Relativity 
principle in 1905. The abandonment of these re- 
strictions, leading to the introduction of the principle 
of Equivalence and Einstein’s law of gravitation, and 
the requisite generalisation of the mathematical method 
are treated in Part II. Some care has been taken to 
indicate precisely the points of similarity between the 
Tensor Analysis used in Part I. and the Tensor Analysis 
for transformations with variable coefficients employed 
in Part II. Chapters X. and XI. are to some extent 
parallel treatments of the matters discussed in Chapters 
VI. and VII. with the restrictions of the earlier state- 
ment of the Principle removed. Chapter XII. is devoted 
to a discussion of the principle of Stationary Action, 
not only with regard to the important place which it 


* The appearance of Relativity as a definite subject in the curri- 
cula, may involve as a necessary change an earlier appearance than 
hitherto of Maxwell’s equations in Physics courses, a change not 
undesirable from other points of view. 
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attained in the Physics of the nineteenth century, but 
also in connection with the possibility of its maintenance 
of that position in the new synthesis. A chapter on the 
purely mathematical problem of solving Einstein’s 
gravitational equation with the bearings of the solutions 
on the questions of planetary orbits, deviation of light 
beams, gravitational displacement of spectral lines and 
gravitational waves, brings Part II. to a close. 
Doubtless the reading of these two parts will involve 
as much time and thought as even the most industrious 
student can spare from the demands of the other 
branches of the physical and mathematical sciences 
in his undergraduate years. But no book on Relativity 
published at present would be complete without some 
account of the very interesting developments which 
have taken place since 1917. Not only for the sake of 
post-graduate reading, but in order to appeal to a wider 
circle of readers, a fairly complete account is given in 
Part III. of the cosmological speculations of Einstein 
and de Sitter and the attempts by Weyl, Eddington 
and Einstein to derive a mathematical theory of the 
electromagnetic field (as well as of the gravitational) 
from the treatment of the metric field of space-time. 
In order to follow these researches some information 
concerning Riemann’s analysis of a manifold which 
can be represented in a multidimensional space is 
required. Thisissuppliedin Chapter XIV. In Chapter 
XV. the question of the finiteness of our universe and 
its bearing on the boundary conditions to be imposed 
on the differential equations of Physics is discussed. 
Chapter XVI. refers in some detail to the remarkable 
analogies pointed out by Eddington between the tensors 
which can be derived from the “curvature tensor ”’ 
of Weyl’s generalised geometry of a manifold and the 
tensors representing the important concepts of present- 
day physical science. It has just been found possible 
to incorporate in the last pages a summary of the most | 
recent contribution by Einstein, a paper which may | 
prove to be of cardinal importance in future develop- 


- ments. 
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The book is confined entirely to the physical and 
mathematical aspects of the theory, and makes no 
pretence to enter into any of the discussions which it 
has provoked in philosophical circles. The author has, 
of course, drawn on all the published sources of informa- 
tion known to him, but of the books published on the 
subject he is specially indebted to the early works of 
Dr. Silberstein and Dr. Cunningham on the Restricted 
Principle, to Professor Eddington’s “ Report ’’ and 
to his ‘‘ Space, Time and Gravitation,” one of the 
literary gems of English scientific literature. Reference 
has also been made to certain parts of Weyl’s “ Raum, 
Zeit, Materie,’”’ of which an English translation has 
recently appeared. The treatment of the relation of 
electron theory to the question of a finite universe, 
given in Chapter XV., is taken from ‘‘ Le Principe de 
Relativité,’’ by M. Jean Becquerel. 

The author wishes to thank Dr. E. L. Ince of the 
Mathematical Department in this University for his 
assistance in reading proof-sheets, and for many valu- 
able suggestions and criticisms. 


THE UNIVERS!TY OF LIVERPOOL, 
May, 1923. 
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INTRODUCTION 


THE reasons why the average man has found it to be a next 
to-impossible task to understand the Relativity principle and 
to reconcile its conclusions with ‘‘ common” sense are fairly 
obvious. The natural phenomena from which the principle 
takes its origin are not facts of “common” experience ;_ they 
are observed in certain intricate and difficult experiments, 
whose purpose and details are only known to a specially-edu- 
cated group of people ; and the natural language for exposition 
of the principle is the language of mathematical symbolism. 
Attempts to explain the Relativity theory without recourse to 
mathematics would appear to be as futile as attempts to write 
a scientific or philosophic treatise in the language of a child’s 
first reading primer. 

Relativity is not entirely a novel principle. In a limited 
sense it had its place in the Physical Science of the last two 
centuries ; but, owing to the fact that instruction in Mechanics 
concentrated on the skilful solution of knotty problems rather 
than on a full grasp of the principles of Dynamics, comparatively 
few of those attending courses in Physical Science were aware of 
the fact that the mathematical equations of motion as formu- 
lated by Newton are quite consistent with the assumption that 
in the treatment of purely mechanical occurrences the existence 
of an absolute space is a matter of no importance, so far as 
questions involving translatory motion as distinct from rotatory 
are concerned. (Rotation, it must be admitted, did offer some 
difficulty even to the limited Relativity of Newtonian dynamics, 
but not an entirely insurmountable one, as Mach was able to 
point out in his “ Principles of Mechanics.”) Let us elucidate 
this Newtonian Relativity by means of a fanciful but not in- 
conceivable experience. 

Suppose a laboratory were mounted on wheels and moved 
along a perfectly straight track with unchanging speed, without 
jar or jolt, then all mechanical experiments would work just 
as before. For example, a given pendulum would hang at rest 
along the same direction, and would oscillate, if set in motion, 
in the same period as before. Atwood’s machines, inclined 
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planes, levers, pulleys would all turn out results as usual; a 
body subject to definite pulls or pushes would move through 
the laboratory along the same path in the same time as when the 
laboratory was stationary. Indeed, if the laboratory had no 
windows, the experimenters, could they have originally been 
kept in ignorance of the initial starting, would be unable 
to tell whether the laboratory were stationary or not, provided 
of course that the movement remained quite uniform and 
rectilinear. Not a single mechanical experiment would by any 
alteration in behaviour or result reveal the motion along the 
track. In scientific language we should name the laboratory 
(or any group of material objects which we for convenience call 
“ fixed’) a “‘ frame of reference,’ because we naturally refer 
the changing positions of the experimental bodies to the appar- 
ently unchanging positions of its walls, floor, benches, etc. 
We can then say that if Newton’s laws of motion are true for 
experiments in one frame of reference they are also true for 
experiments in any other frame which is in uniform motion 
relative to the first. That is the scientists’ way of generalising 
the facts mentioned above, and it is of immediate interest 
because it brings home to us the very important thought that 
if Newton’s laws happen to be true for the movements of bodies 
as measured from a definite frame of reference, nevertheless 
this frame enjoys no special privilege. It is not necessarily 
an absolutely fixed frame on that account. The laws would 
be equally true in a multitude of other frames, and it is im- 
possible to suggest that Newton’s laws yield any test by which 
we could single out one frame, in distinction to all others, which 
is at rest in space in an absolute sense (assuming such absolute 
rest to be a perceivable property). 

This absence of any criterion of absolute rest is implicit in 
the Newtonian equations of motion ; for while the velocities of 
a body relative to two frames of reference (themselves in uniform 
relative motion to one another) are different, the accelerations 
are identical. Further, the forces on a body are in Newtonian 
theory the resultants of forces on its particles directed towards 
other particles of matter. Such component forces are functions 
of the distances separating the particles; and these distances, 
and therefore the forces, are absolute and independent of the 
frame of reference. So the proportionality of force to accelera- 
tion, or the invariability of mass, which is the essential feature 
of Newtonian Dynamics, is preserved in any frame of reference. 

Mechanics, albeit the oldest, is yet only one of the physical 
sciences. -The great advance made in the mathematical formu- 
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lation of optical and electromagnetic theory during the nine- 
teenth century revived the hope of discovering the earth’s 
absolute motion in space by means of the laws derived from 
this theory. One direct method which was suggested, con- 
sisted in an endeavour to measure differences in the speeds of 
light in different directions. It was argued that light would 
have a definite and unique velocity through space ; but that 
the velocity through terrestrial apparatus would be this velocity 
increased or decreased by an amount depending on the earth’s 
velocity through space and its relative direction to that of the 
source emitting the light, so that if the earth were moving 
towards the source the measured velocity would be equal to 
the real velocity of light plus that of the earth, while if the 
earth were moving away from the source the plus would be 
read as minus; just as two trains pass each other with a 
relative speed equal to the sum of their track speeds if they 
are travelling in opposite directions. This method seemed so 
hopeful that a great deal of patience and skill was expended 
on perfecting apparatus for this purpose, and in the end the 
American physicist Michelson and his pupils devised an instru- 
ment so ingenious and precise that failure to measure the 
anticipated differences was inconceivable—provided they ex- 
isted. Yet failure followed each successive attempt despite 
every improvement in apparatus and technique. No charge 
could be brought against the skill and precision of the experi- 
mentalist ; the only conclusion was that somewhere in the 
reasoning of the theoretician there was an erroneous assump- 
tion. 

Michelson’s experiment consisted in timing a race between 
two beams of light, each of which, starting from a common 
point A, travelled to reflecting mirrors and returned to A. Two 
mirrors, B and C, were set so that AB and AC were at right 
angles. Supposing the apparatus to be at rest in absolute 
space, the time of each journey would be 2/,/c and 2/,/c, where 
AB =1,, AC =/,, and the velocity of light in space is repre- 
sented by c. But the apparatus was, of course, situated on 
the earth, and so was in movement through space with a 
velocity, which during the time occupied by an experiment, 
could be considered as uniform. Represent this velocity by 
u. Then the expressions written above would no longer 
represent the times involved in the race, if the considerations 
advanced in the previous paragraph be valid. Suppose, for 
example, that AB were parallel to the direction of w and AC 
therefore perpendicular to u, it can be shown by elementary 
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kinematical reasoning that the beam travelling to B and back 
would take the time 2a2/,/c, and the other would take 2al,/c, 
where a is written as a convenient symbol for the expression 


(x utjct)-3 


and is, of course, always greater than unity. Hence the differ- 
ence in the times occupied by the two journeys would be 
2(a22, — al.)/c. Were AB, however, perpendicular to the direc- 
tion of wu and AC parallel to it, this difference would be 
2(al, — a?l,)/c. These results for the two particular settings 
of the apparatus will account for the expectation of the ex- 
perimenters that a gradual rotation of their apparatus should 
have revealed a change in the time by which one of the beams 
arrived at A ‘“‘ ahead” of the other. The optical method em- 
ployed to detect the anticipated change was certainly sensitive 
enough, even if the velocity of the earth through space were 
only of the order of its known relative velocity in its solar orbit. 
Yet not the slightest alteration in this difference could be de- 
tected despite experimental refinements and precautions which 
left no doubt that the absence of the expected effect was not 
due to lack of precision in the apparatus. Suggestions for 
evading the difficulty took the line of assuming either that the 
ether round the earth was not at rest, but was carried along 
with it, or else that a source of light in motion imparted its 
velocity to the ight and produced, when compounded with the 
“true” velocity of light, a resultant velocity whose magnitude 
depended on the velocity of the source and on the relative 
direction of the light beam and the motion of the source. 
Suffice it to say that neither of these suggestions survived 
the criticism of physicist and astronomer, and in a short time 
physicists came to accept a totally different suggestion first 
propounded by Fitzgerald, but worked out independently and 
much more completely by Lorentz. This amounted to an 
assumption that the dimensions of a body alter when it is in 
motion through the ether, as compared with its dimensions 
when. at rest ; that in fact a length of material equal to 7, when 
at rest, would contract to a length //a or U(r — w?/c?)} when 
moving with a velocity wu through the ether in a direction 
parallel to this length, c being the velocity of light through 
the ether ; across the direction of motion no alteration in size 
takes place. This contraction being independent of the 
material of the body and only depending on its velocity, it 
could not be detected by measurement with any material 


INTRODUCTION 5 


standard of length ; for when body and standard were juxta- 
posed for comparison, each would suffer exactly the same 
change, because each would have the same velocity through 
absolute space, and so coincidence of marks on the body with 
definite graduations on the standard would not be disturbed. 
In fact, the hypothesis of contraction stood by itself, unrelated 
to any other physical fact and merely accepted because it enabled 
physicists to maintain the hypothesis of the stagnant, immobile, 
and universal ether in face of Michelson’s negative result. How 
it achieved this is easily grasped by reference to the expressions 
above. Thus, when AB is parallel to w and AC perpendicular 
to it, the length of AB is not /,, but /,/a, while that of AC is 
still /,. Hence, the times for the two beams are 2a2/,/ac and 
2al,/c, which give as the difference 2a(/, — /.)/c, and in the other 
orientation the times are 2al,/c and 2a%/,/ac, yielding just the 
same difference, viz., 2a(l, — /,)/c. 

Despite the satisfaction derived from a hypothesis which so 
neatly explained the Michelson experiment, it still suffered 
from the defect mentioned above, viz., its lack of relation to 
any other physical change in the presumably contracted body. 
It is known, for instance, that ordinary mechanical strains in 
transparent, isotropic materials like glass would change their 
optical behaviour, endowing them with the property of double 
refraction, for example. The idea occurred to Rayleigh that 
although the contractions due to terrestrial motion through the 
ether would evade detection by direct measurement, yet they 
would render a body doubly refracting with its optic axis along 
the direction of motion, so that if the body were rotated the 
change in the orientation of the optic axis in the body itself 
could be detected. He instituted a search for this phenomenon, 
but in vain. A second research by Brace with more powerful 
experimental appliances proved equally futile. Other experi- 
ments, seeking with every hope of success to discover some 
physical alteration in the contracted body (e.g., a variation in 
the electrical resistance of a wire) as its orientation was changed, 
met with no better fate. Purely negative results were the 
invariable outcome of these attempts. Baffled in their search 
for this absolute motion of the earth, physicists were compelled 
to review the whole foundations of their science, and especially 
that part of it dealing with electremagnetic phenomena. It 
was just at this period that the theory of the electrical con- 
stitution of matter was being constructed and giving evidence 
of its great power; and it was at once recognised that this theory 
would have to be framed so as to account for the fact that all 
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the properties of a portion of matter were so nicely modified 
by any change in its motion through the ether as just to 
obliterate the effects which might naturally be expected to 
show themselves on account of this change of motion. On the 
one hand, it was discovered that the Lorentz-Fitzgerald con- 
traction had to be not merely a property of matter in bulk, 
but also an effect experienced by the most minute portions of 
matter, electrons and atomic nuclei. On the other, it was 
recognised that Maxwell’s equations of the electromagnetic field 
would have to be valid not only for axes of reference fixed in 
the ether, but also valid im the same mathematical form for axes 
moving uniformly through the ether. Now this point, which 
is vital to the true understanding of Relativity, is one which 
is extremely difficult for the non-mathematical reader to grasp. 
As we have just said, we are committed to an electrical theory 
of matter, and this theory is built on the validity of Maxwell’s 
equations. We postulate the existence of two physical vectors, 
an electric and a magnetic intensity, which have definite values 
at every point, and Maxwell’s equations indicate to those who 
are conversant with mathematical symbolism, the manner in 
which a knowledge of the values at one moment of these vectors 
at all the points surrounding, and in the immediate neighbour- 
hood of, a definite point can lead to the values of these vectors 
at this point at a later moment. The manner in which the 
value of either vector at a given point varies during a brief 
interval of time is quantitatively determined by the differences 
which exist at one instant between the value of the other 
vector at that point and its values at the points within a small 
region surrounding the given one. The equations are, in fact, 
a highly symbolic guide for predicting the history of the vectors. 
If these equations have what we may call their standard form 
in a system of axes fixed in the ether, it would not be unnatural 
to expect that if we use co-ordinate axes attached to a piece 
of matter in motion through the ether, the form of the equations 
suitable for such axes could not be the same as before, but 
would involve a symbol, “, which would represent the speed of 
the matter through the ether. But if this expectation is satis- 
fied, it is not difficult for the mathematician to show that it 
can be deduced from the equations that the velocity of light 
relative to the moving axes will depend on wu, and, further, that 
the experiments mentioned above should yield positive results. 
So, in order to agree with known facts, the equations for the 
moving frame of reference must not contain #, and they must 
have the same mathematical form as before. If the moving 
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frame is travelling parallel to a common axis of x of the two 
systems of axes, the co-ordinates of a definite point in the 
ether would be e.g. (x, y, 2), referred to the fixed system, while 
referred to the moving system they would at time #, be 


x — ut 
vi v= A ie 


Electromagnetic theory shows that the values of the electric 
and magnetic vectors at this point as observed by the fixed 
experimenter would differ from the values observed by the 
moving one; these can be represented by E and H for the 
former, and by E’ and H’ for the latter, and known mathe- 
matical relations are available for determining E’ and H’ in 
terms of E, H, and uw. It is not necessary at the moment to 
inquire what these equations should be, for it can be shown 
that if we retain equations (1), which are a direct expression 
of Newtonian kinematics, then it is impossible to derive from 
Maxwell’s equations in the symbols E, H, x, y, z, ¢ equations of 
the same mathematical form in the symbols E’, H’, x’, y’, 2’, t. 
But this is apparently what we must do to satisfy the experi- 
mental results. Lorentz was able to show that we could go 
some way towards the desired goal if instead of (I) we sub- 


stituted 
x’ = a(% — Ut) } (2) 
ey a : : , 
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a being, as before, a short symbol for (x1 — w?/c?)-#. This 
change after all was quite in keeping with the development of 
ideas since it was just the analytical change required to allow 
for the postulated contraction of all the standards of measure- 
ment used by the moving observers. Nevertheless, it was 
insufficient for the purpose in view. At last Lorentz was able 
to show that if we introduced another symbol ¢’, which was 
given by the equation 


= a(t — ux/c?) . : . * ¥(3) 


then from Maxwell’s equations in E, H, x, y, z and ¢ can be 
derived equations of exactly the same form in F’, H’, x’, y’, 2’ 
and ¢’. Thus the mathematical problem was solved, but there 
was one serious difficulty as to the relevance of the solution to 
the physical facts. Lorentz referred to ?’ as the “ local” time 
for an observer situated in the moving frame and occupying 
the position x, y, z in space at the “ true” time ¢; but he 
never suggested that it would in any sense be the time observed 
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by such an observer with any of his appliances. Any difficulty 
about accepting x’ as a real relative co-ordinate of position had 
never arisen in view of the Lorentz-Fitzgerald hypothesis of 
contraction, but there was no corresponding hypothesis con- 
cerning “‘ contraction” or “‘ expansion ” of time on a moving 
body to permit a rational being to accept #’ as an actual measure- 
ment by appliances in motion of an interval of time which 
would be measured as ¢ by precisely the same appliances at 
rest. It is to the genius of Einstein that we owe that amazing 
suggestion. But, of course, the enormous importance of his 
suggestion lay not in the mere enunciation of such a thesis, 
but in the clear and cogent reasons which he advanced in 
support of it. It has already been stated that the equations 
of optical and electromagnetic theory as formulated mathe- 
matically by the moving observer in terms of x’, y’, 2’, ’ turn 
out to be of the same form as those formulated by the fixed 
observer in terms of x, y, z, ¢. This is highly satisfactory in 
so far as it is consistent with the negative results of all experi- 
ments for detecting motion through the ether or for detecting 
hypothetical changes due to this motion; but it leaves us in 
the position of worrying ourselves about a purely irrelevant 
concept, viz., absolute rest. Thus, if we solve the equations 
just written for x, y, z,¢in terms of x’, y’, 2’, t’ we obtain, after 
two or three easy steps, 


% = a(x’ + ut’) 
YoY zz (4) 
= a(t’ + ux'|/c?) 
But these could have been obtained from the former set by 
merely interchanging the accented and unaccented letters and 
changing the sign of w. The plain meaning of this is, that if 
the observer in the moving frame chooses to assert that it is 
he who is absolutely fixed, and that it is the other observer 
who is in motion through the ether; that his time is the 
absolute and the other “local,” there is not a particle of 
evidence to settle his claim one way or another. Einstein in 
1905 pointed out this remarkable property of the Lorentz 
equations (not only those written above concerning co-ordinates 
and time, but still further equations of his theory relating to 
each other various physical magnitudes as measured in the 
two frames), viz., that all question of velocity in a postulated 
absolute space had faded away; that any difference in the 
measurement of physical magnitudes by two observers was 
determined entirely by their velative motion to each other, and 
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in no way depended on their assumed absolute velocities through 
space. In effect, whatever importance the philosopher may 
give to the concept of absolute space, it is a notion of no value 
to the physicist ; discussion of it is entirely irrelevant to his 
special work. Not only is this true for the concept of absolute 
space, it is also true for that of absolute time. ‘‘ Common 
sense ’’ shrinks from such a conclusion as paradoxical ; yet, as 
we hope to show very shortly, it is quite easy to deduce it if 
one is prepared to grasp and accept Einstein’s views on the 
measurement of time. 

As we have seen, equations (2) and (3), or their equivalents 
(4), were set up in the first instance in an attempt to solve a 
definite problem in electromagnetic theory ; yet, as Einstein 
pointed out, they are just the equations we shall arrive at if 
we abandon all considerations of an absolute space, or of any 
special theory of matter and electromagnetic phenomena and 
simply consider the reactions on pure kinematical reasoning of 
the underlying fact that light has the same velocity relative to 
all observers, whatever be their own velocities relative to each 
other. 

A person sitting in a street car will say that he is in the same 
place during his journey if he is only thinking of the car, but 
not so if he is thinking of the street. Likewise, if he is stand- 
ing in the street, he will say he remains in the same place if 
he is thinking of the earth, but not if he is thinking of the 
solar system ; and there is no ascertained limit to such a sequence 
of statements. In fact, the phrases, “in the same place,” 
“in different places” have no exact physical meaning except 
in relation to some specified frame of reference. An object 
remaining in the same place in one frame is changing place in 
another. It is one of the most notable features of Einstein’s 
work that he has pointed out a similar lack of precision in the 
phrase, ‘‘ at the same time,” when used without consideration 
of a specified frame of reference. Over the earth we keep 
time by mechanical appliances, such as clocks, in which some 
periodic occurrence is definitely related to one rotation of the 
earth relative to the stars. However, these appliances require 
frequent adjustment, and in the last resort, if the order of ac- 
curacy desirable for time measurement required it, adjustment 
would have to be made by means of-light signals, and this 
method would have to be adopted in any case if observers were 
inaccessible to one another. One observer at a pre-arranged 
reading of his time-piece would despatch a flash of light which 
‘would reach another observer. The latter, after allowing for the 
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time required for light to travel the distance, would set his clock 
accordingly. It is in the italicised words that the essence of 
Einstein’s method of time measurement lies. For example, if 
the two observers be at relative rest to one another, the second 
observer divides their ascertained and permanent distance 
apart by the velocity of light, thus finds the time which the 
light takes and allows for that amount. If, however, the 
observers be in relative motion, the second observer realises 
that in his frame of reference the first observer (the despatcher 
of the signal) is moving. He must ascertain what place in his 
frame the sender occupied at the instant the signal was de- 
spatched, and divide the distance from the place to himself by 
the speed of light in order to make allowance. But what 
speed ? According to earlier views, it would have been a 
modified speed obtained by combining in a well-known way 
the (assumed) absolute speed of light in space with his own 
speed in space. Were that the case, no trouble would have 
arisen; but, as Einstein postulated on the strength of the 
experimental evidence, that cannot be; he must divide by the 
same speed of light as before—always the one and only speed 
as ascertained by all observers, no matter what the assumed 
absolute motion may be. The effect of insisting on this can 
be seen from a simple example. Suppose we have two observers 
at rest in one frame of reference, say at some distance from 
each other on a long straight road. From a lamp fixed in the 
road and midway between them a flash of light is despatched 
outward by uncovering a shade for as brief a time as possible. 
The arrival of the light at the one observer’s station will be 
simultaneous with the arrival at the other’s—simultaneous, 
that is, for them—because they will say that the lamp was 
equidistant from them when the light was despatched, and 
light travels at the same speed in all directions. But now 
consider another frame of reference, a long lorry, for instance, 
moving along the road, so long that a man at one end of it 
would just be passing one road observer as the light reached 
him, and another man at the other end of the lorry would be 
passing the second road observer as the light reached him. 
For clearness, suppose the road lies east and west and 
the lorry is travelling east, and let us speak of the east and 
west roadman and the east and west lorryman. The two 
lorrymen would admit that as the light takes time to travel, 
they must have individually been a little westward of their 
respective roadmen when the light was despatched, since they 
were abreast when it arrived. That being so, the west lorryman 
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was then a little further away from the point of despatch than 
the east lorryman. In other words, the lamp was not midway 
between them when the light signal started. Consequently, 
the arrival of the light at one lorryman’s place is not simul- 
taneous with the arrival at the other, because the velocity of 
light is the same eastward or westward (or in any direction) for 
all people on the lorry. On the older view, the east lorryman 
receding from the lamp would have said that the light is advanc- 
ing to him more slowly than to the west lorryman who is moving 
towards the lamp, and so the shorter distance would be com- 
pensated by a smaller velocity of light and the longer distance 
by a larger velocity. But in the face of the experimental 
evidence this hypothesis concerning variation in the velocity 
of light is untenable, and so the conclusion is quite valid that, 
whereas the roadmen agree that the light reached the momen- 
tarily common position of the east roadman and lorryman and 
that of the west roadman and lorryman “ at the same time,” 
the lorrymen will agree that it reached the momentarily 
common position of the east lorryman and roadman “at an 
earlier time ’”’ than that of the west lorryman and roadman. 

Many will feel that the rather impracticable nature of the 
illustration is detrimental to the conclusion. Not so--it is 
merely an attempt to sketch in words the vital feature of the 
mathematical analysis employed by Einstein. Further, some 
may say that the minute times involved in the transference of 
light would be quite unmeasurable by any of our appliances. 
That is not a valid objection. The physicist’s aim is to lay 
down principles and develop analysis thereon which will survive 
any practical test, however severe and precise. Whatever 
experimental evidence is available (and some of it is extremely 
searching) justifies Einstein’s postulates. It is possible, per- 
haps, to raise philosophical objections to Einstein’s method of 
relating time by light signals, but any physicist who accepts it 
as a basis for physical time must accept Einstein’s conclusions. 
There is no evasion of the logic possible. 

When anyone has grasped the inner meaning of this 
“paradox,” i.e., realised that it is not a paradox, he will find 
that two other well-known “ paradoxes”’ of Einstein’s treat- 
ment of space and time can also be revealed to him. We will 
adopt the same fanciful frames as before. A flash of light is 
emitted from the lamp, and a little later another flash is emitted, 
and the interval between the flashes is adjusted so that the 
departure of the second signal from the lamp is simultaneous 
‘ with the arrival of the first flash at a place on the road lying 
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to the east of the lamp. If the reader has really grasped 
‘paradox’ number one, he will at once ask, ‘To whom are 
these events simultaneous ?”’ Let us suppose they are simul- 
taneous to observers on the road, i.e., if a man stationed on 
the road at the place determines the interval between the 
arrival of the first and the second flash, he finds it to be equal 
to the time required for light to travel the distance between 
the lamp and his position. But now consider two men on the 
lorry so situated that one passes the lamp as the second flash 
leaves it, and the other just passes the place on the road as 
the first flash reaches it, the second lorryman being naturally 
to east of the first. From what we have just reasoned out, 
the arrival of the first flash at the place and the despatch of 
the second flash from the lamp are not simultaneous events 
to the lorrymen; for them the former event occurs earlier 
than the latter. That is, they put the despatch of the second 
flash from the lamp further on in time from the despatch of 
the first flash than the roadmen do, or they find a greater 
interval of time between the flashes than the roadmen. Of 
course, it must be noted that these two events, viz., the emis- 
sions of the two flashes occur at the same place for the roadmen, 
but not for the lorrymen, who see the lamp moving westward 
between the flashes. We can put this conclusion into formal 
language as follows: Two events occur at the same place in a 
definite frame of reference, but at different times, and the 
interval of time between them is measured by observers in this 
frame. With reference to observers in another frame moving 
relative to the first, these events occur at different places ; 
moreover, the interval of time between the events as measured 
by the observers in the latter frame is greater than that measured 
by those in the former. 

The third ‘‘ paradox” concerns the measurement of the 
distance between two marks on the road by roadmen and 
lorrymen respectively. Call the marks fixed on the road A and 
B respectively, B lying to the east of A. Road observers at 
A and B agree that at a prearranged time on their clocks, they 
will make two chalk marks on the lorry as it passes them. 
These marked places we call C and D, and so the passage of C 
past A and D past B are simultaneous events—éo the roadmen, 
and they will say that C to D is the same distance as A to B. 
What will the lorrymen say? Why, that D passed B earlier 
than C passed A (because, according to the first of our three 
propositions, events which are simultaneous to people on the 
toad are not simultaneous to people on the lorry), i.e., that 
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some mark E on the lorry to the west of D passed B at the same 
time as C passed A. -Hence they will return the distance AB 
as equal to CE, which is shorter than CD. So we add another 
“paradox ”’ to our formal statements. Two events occur at 
the same time in a definite frame of reference but at different 
places, and the distance between the places is measured by 
observers in this frame. With reference to observers in another 
frame moving relative to the first, these events occur at different 
times ; moreover, the distance between the places as measured 
by observers in the latter frame is less than that measured by 
those in the former. 

Anyone who can grasp the standpoint from which these 
three propositions are developed is in a position to realise the 
difference between it and that of the earlier work of Lorentz 
and Larmor. For those who maintain the existence of the 
universal ether as an absolute frame of reference, a body at 
rest in the ether has certain linear dimensions which are the 
same for all observers, whether they are at rest in the ether or 
not. If the body is moving through the ether, its dimensions 
at right angles to the motion are unchanged, its dimension 
parallel to the motion is shortened ; but this shortening is the 
same to all observers whether at rest in the ether or not, i.e., all 
observers obtain the same shortened length. The shortening 
is a matter dependent solely on the absolute movement of the 
body through the ether, and has nothing to do with the relative 
motion of the observer to the body. It is an absolute property 
of the body. For the relativist, however, motion through an 
ether is irrelevant. The dimensions of a body are different for 
different observers. For an observer at rest relative to the 
body, they have certain values, for an observer moving relative 
to the body the dimension parallel to the relative velocity is 
smaller, and the shortening is more marked the greater the 
relative velocity. In fact, it can be demonstrated by appli- 
cation of mathematical reasoning to the case considered above 
that a length which for a relatively stationary observer is /, is 
for an observer with a relative velocity u, l(1 — u?/c?)? where 
c is the velocity of light, provided the length is parallel to the 
relative velocity. The size of a body measured by a relatively 
stationary observer is greater than that measured by any other 
observer, and is for convenience referred to by relativists as 
its ‘‘ proper” size; but the word has nothing of an absolute 
nature about it. In short, length, area, volume are all relations 
between an observed body and the person observing it. 

Similar remarks apply to measurement of time. Lorentz 
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found it convenient to introduce “local” time for each 
observer ; but as he looked at it this was merely a mathematical 
artifice in order to enable the moving observer (that is, moving 
absolutely in space) to summarise his facts in equations possess- 
ing the symmetrical and simple forms obtained by the observer 
at absolute rest, using true and absolute time. Further, an 
interval of local time between two events depends on the 
interval of absolute time between them and the velocity of 
the locality through the ether. On the other hand, Einstein 
takes “local ’’ time as genuine physical fact and not a mere 
artifice ; further, he holds that no observer can claim that his 
“local”? time is absolute in contradistinction to all others. 
An observer at rest in a locality where two events occur will 
find a certain interval of time between the two events. Any 
other observer in motion relative to the locality will find a 
longer interval of time between the same two events. It is 
convenient to refer to the measurement of the relatively 
stationary observer as the “ proper’’ interval between the 
events, and it can be shown that the measured interval for the 
relatively moving observer is obtained trom the “ proper ”’ 
interval by division with the expression (1 — 1?/c?)?, i.e., the 
connection between the two measurements is solely a matter 
of relative velocity, and independent of any assumed ethereal 
velocity. Once more the time elapsing from one event to 
another is a relation between the events and the person 
observing them. 

It should be unnecessary to remark that these conclusions 
are not in any way dependent on the mere adjustment of time- 
pieces to local “‘ origins ”’ of time, such as the choosing of local 
noons on our earth. And, of course, all material standards of 
length and time are to be such as agree perfectly with each 
other when juxtaposed and 2m relative rest to each other. But 
it may be as well in connection with the example used above 
to note the fact, which rests on the purely reciprocal nature of 
the theory, that as the road is moving westward relative to 
the lorry, two events which are simultaneous to lorrymen are 
not simultaneous to roadmen ; the event occurring at the west- 
ward place is prior to the other for the roadmen (in general, 
prior at the place which is in front of the other with regard to 
the motion of the “ moving” frame relative to the conven- 
tionally “fixed” frame). Also, if the lorrymen measure a 
certain interval of time to have elapsed between two events 
which occur at a place fixed on the lorry, the roadmen measure 
a longer interval between these events, and the roadmen will 
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obtain a shorter distance between two places fixed on the lorry 
than the lorrymen themselves will obtain. 

Before proceeding, let us briefly review the position at 
which we have arrived at this stage of the exposition. If we 
assume that all mechanical phenomena are consistent with 
Newton’s laws of motion, we can, without postulating any 
special properties of matter, explain why there is a relativity 
of all such phenomena (i.e., why none of them can indicate to 
us any privileged frame of reference which can be regarded as 
fixed in space in an absolute sense), provided we agree that the 
equations which transform the co-ordinates in one frame to 
those in another are of the type (1), or aay mathematical 
generalisation of them. On the other hand, if we assume that 
all electromagnetic and optical phenomena are consistent with 
Maxwell’s equations of the electromagnetic field, we can again, 
without any assumptions as to special properties of matter, 
explain why there is a relativity of these occurrences, provided 
we agree that the equations for transforming co-ordinates and 
time are of the type (2) and (3). But we cannot have the 
relativity of electromagnetic phenomena with equations (1), 
nor of mechanical phenomena with equations (2) and (3), 
unless we are prepared to modify the laws of the electromagnetic 
field in one case, or the laws of dynamics in the other ; and in 
any case, a great desideratum for any consistent theory should 
be the avoidance of postulates concerning absolute changes 
in properties of matter which would turn out to be entirely 
elusive of observation, e.g., the Fitzgerald contraction. 

Einstein naturally proposed to maintain the Relativity 
standpoint by retaining equations (2) and (3) which covered 
electromagnetic phenomena, and by modifying the Newtonian 
laws of motion in such a way that they would preserve the 
same mathematical form in all frames of reference. The reason 
why they were consistent with Relativity so long as equations 
(x) are considered true was indicated above—simply because 
the acceleration of a definite body and the force on it main- 
tained the same values in all frames. But this simplifying 
condition is entirely absent if equations (2) and (3) form the 
basis for transformation of co-ordinates and time. Suppose 
the frames of reference are called S and S’, and the axes in each 
chosen so that O’X’ is along OX, and that S’ has a uniform 
velocity « relative to S in the direction OX. Let a particle 
pass through the position (x,y,z) at time ¢ having a velocity v with 
reference to S. With reference to S’, it will pass through 

“(x’, y’, 2’) at time ¢’ where x’, y’, 2’, t’ are connected with ~, y, z, ¢ 
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by equations (2) and (3), and its velocity, which we shall represent 
by v’, will be connected with v by certain equations, which we 
proceed to quote. We shall represent the components of v in 
S by v4, vg, vs, and of v’ in S’ by v,’, v2’, vs’._ Further, it will be 
a convenience to represent the quantity (x — v?/c?)-? by a 
single symbol, say f, and, similarly (1 — v’*/c?)~? by f’. 
Whereas in Newtonian theory we have the very simple equa- 
tions 
DV, = Ugo, Uy == Veg vs es: 


pure kinematic reasoning based on equations (2) and (3) yields 
the following result :— 


p'v,' = a(Bv, — uB) 


B’vq' = Bre; B’v,' = Brg (5) 
B’ = a(B — uBv,/c?) 
where, as before, a = (I — u2/c%)-4 


or the equivalent 


Bu, == B’vy' ; Bus = B'v,' (6) 


Bo, = a(B'vy' + uf’) 
B = a(f’ 4- uB'v,'/c*) 

In fact, the equations of transformation for the quantities 
Us, Pls, Bs, — are precisely the same as thosesior 77 =. 
or, as the mathematician would say, fv,, Buz, Bv3, B are co- 
gredient with x, v, 2, 4. No simple relation between accelera- 
tions in the two frames can exist with these equations for 
transforming velocities ; indeed, a body moving with a uniform 
acceleration in S is not moving with a uniform acceleration 
in S’.. Turning now to the relation between the forces exerted 
on the body as measured in the two frames, it appears that 
immediate progress in the direction of establishing Relativity 
is possible if we leave aside for the moment the older notions 
concerning forces which are the sum of individual forces 
exerted on particles by other particles according to some law 
of distance, and consider forces arising from electromagnetic 
action. If, for instance, the moving particle carries a given 
charge, the force exerted on it in a given electromagnetic field 
can be calculated in terms of the electric and magnetic vectors 
and the velocity. The formula employed is the one due to 
Lorentz and well known to all students of Electron Theory. 
The vectors in S and S’ are connected by equations which we 
have referred to above without quoting, and the velocities by 
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(5) or (6), so that the relations between the forces on the par- 
ticle in each frame can be determined. We use Bits, 2s 
for the force and its components in S, and similar accented 
letters in S’. We also require a symbol for the “ activity ”’ of 
the force on the body, i.e., its rate of doing work on or of adding 
kinetic energy to the body. This is measured by the scalar 
product of F and v; we shall denote it by A. The relations 
are then 


B’F,’ = BF, ; B's’ = BF, 
B’A'/c? = a(BA/c? — upF,/c?) 


BF Y' = o(BF, — uBA/c2) 
(7) 


or, in brief, BF,, BF., BF3;, BA/c? are also cogredient with 
x, Y, 2, t. 

By means of (5), (6), and (7) it is possible to determine the 
relation between force and acceleration in the frame S’ if the 
law connecting them as measured in S is known. It appears 
that if the law in S is one of simple proportionality, then no 
such simple law will hold in S’, and vwice versa. Such a law, 
therefore, does not satisfy the principle of Relativity. But 
after all, the proportionality of force and acceleration is not a 
complete statement of Newton’s laws of motion. The full 
statement of the second law is that force is proportional to the 
rate at which momentum is changed, and as momentum is the 
product of mass and velocity, it is clear that the proportionality 
of force to acceleration is dependent on the further assumption 
(which Newton certainly made) that the mass of a body regarded 
as a measure of its inertia is an absolute quantity which does 
not depend on its condition as regards any other physical 
property, and in particular on its state of motion. But before 
the advent of the Relativity hypothesis physicists had, in 
developing the electrical theory of matter, been compelled to 
abandon the independence of mass and velocity in the case of 
the electron, and Einstein was able to show that a complete 
abandonment of it for any body was required in order to bring 
dynamical laws within the scope of Relativity. In short, if 
Newton’s assumption that the mass of a particle is invariable 
and independent of its velocity be replaced by the assumption 
that in a definite frame the mass increases with the velocity 
in such a way that it varies as 8 or (1 — v?/c?) — +, mathematical 
reasoning proves that the proportionality of force to rate of 
change of*momentum (not, of course, to acceleration) is a law 
of motion which is true in any of the frames of reference, if it 
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be true in one of them. For a velocity which makes the ratio 
vjc small, the difference between the results of Einstein’s law 
and Newton’s is experimentally inappreciable, but in the case 
of the electron, for which v/c is not negligible, this law of 
variation of mass with speed is just what is required to reconcile 
the observed motion with the deductions of theory. 

It is advisable once more to consider closely the difference 
between the new point of view and the old. To be sure, 
Lorentz’s electron theory involves a variation of the mass of 
the electron, and the mathematical formula is identical with 
that of Einstein; but in Lorentz’s formula v is a velocity 
through the ether. An electron with a mass m, when at rest 
in the ether, has a mass mB when moving through the ether 
with velocity v. Its mass changes, but its values for a given 
state of motion in the ether is the same for all observers, no 
matter what are the frames from which they make their obser- 
vations on the movement of the electron. On Einstein’s view, 
the mass is m to any observer who is at rest relative to the 
particle (“ proper’’ mass), and mf to any observer to whom the 
relative velocity of the particle is v. Mass is a relation between 
the observed thing and the observer; absolute motion has no 
significance whatever in connection with it. 

Another striking result follows from Einstein’s law of motion. 
It is found that the work done by a force on the particle in 
increasing its velocity from zero to v is m(B — 1)c?. This is, 
of course, the measure of the particle’s kinetic energy. Hence 
we see that the kinetic energy of the particle is proportional 
to the excess of its mass for the given velocity over its proper 
mass. This proportionality of increase of energy to increase 
of mass is again quite in keeping with the most recent findings 
of the electrical theory of matter, which actually suggest that 
the concepts of energy and mass should be fused into one, 
that even what we call the proper mass of a body is a measure 
of the intrinsic energy of the complex electrical system con- 
sisting of the nuclei and electrons of its atoms. So far from 
opposing such a doctrine, Relativity accepts it and gives it a 
significance which is wider inasmuch as it postulates no special 
theory of the constitution of matter. 

Any student of Physics will realise that if the laws of 
Dynamics and the laws of Electromagnetism can be expressed 
in forms which are valid for any one of a group of reference 
frames in uniform motion relative to each other, then a 
great advance has been made in exhibiting the conformity of 
all physical phenomena to the principle of Relativity. For if 
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the electrical theory of matter is correct in explaining such 
dynamical concepts as inertia and force in terms of groups of 
subatomic charged corpuscles (nuclei and electrons), then it 
follows that the movement of such particles, and therefore the 
properties of matter in bulk, must follow laws which are 
independent of the frame of reference chosen. There is one 
significant exception, however. Gravitation is a phenomenon 
which by reason of its very simplicity and universality has so 
far stood outside any special theory of matter. Moreover, and 
this is a point of greater importance at the moment, the law 
propounded by Newton for its measurement fails to satisfy the 
criterion which is required by equations (7) above. The law 
is, in fact, based on action at a distance, and expresses the 
force in terms of distances between individual particles of 
matter. Shortly after the appearance of Einstein’s first paper, 
and when this anomalous position of gravitation had been 
recognised, Poincaré attacked the problem of modifying the 


Newtonian law so as to render it invariant for all the frames . 


connected by a Lorentz transformation. He showed that a 
modified law which does not contradict astronomical observa- 
tion is possible, and removed the old difficulty raised by La- 
place that gravitation must be propagated with an infinite 
speed, an assertion which would, if true, place gravitation 
outside the Relativity principle at once. But while Poincaré’s 
work was significant from the staudpoint of the restricted 
principle of Relativity, it still left unexplained certain discrep- 
ancies between observation and Newtonian theory, e.g., the 
anomalous motion of the perihelion of Mercury, and, as a matter 
of fact, it was within a few years completely overshadowed by 
a further brilliant generalisation made by Einstein himself, 
who pointed out that the problem of relating gravitation and 
Relativity was intimately bound up with the problem of ex- 


tending the principle of Relativity to cover any kind of rela-_ 


tive motion (not merely uniform) of the frames of reference. 

A few pages back the reader was asked to imagine the 
course of events in a moving laboratory, whose speed along 
a straight track is uniform. Let us now consider events in 
this laboratory if the speed of the laboratory be uniformly 
accelerated, say in an easterly direction. Every one who can 
recall his sensations in a tram or railway car, whenever it is 
starting or stopping, will find no difficulty in admitting the 
following statements. Everything in this laboratory would 
experience a tendency to move through the laboratory towards 
the western wall with an accelerated sbeed. That is the natural 
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way in which a laboratory inmate would express his experiences. 
Of course, outsiders would say that both the laboratory and 
internal objects are moving eastwards, but whereas the speed 
of the laboratory is increasing, that of the internal objects, 
unless they are attached rigidly to walls, ceiling or floor, 
or affected by friction with these, remains unchanged, and 
thus the western wall is overtaking them. The facts are the 
same ; the mode of expression depends on the frame of reference 
from which one views them. In the laboratory a pendulum 
would not hang along the original vertical, but would rest 
along a direction oblique to the floor, approaching the western 
wall nearer at its bob than at its point of suspension, and 
would oscillate about this line if disturbed ; unsupported bodies 
would fall in this direction ; water in a vessel would bank up 
on the western side of the vessel so as to preserve a plane 
surface, but at right angles to this same direction. Everything 
in the laboratory would be affected in a similar way. If friction 
were sufficiently small, loose bodies on the floor and benches 
could only be kept from moving with increasing speed towards 
the western wall by fastening them to the eastern wall or by 
some similar device. In fact, it would not be unnatural for 
the laboratory population to say that for some reason or other 
gravitation was acting towards the western wall as well as 
towards the floor. If no vision outside their laboratory were 
possible they might even imagine that they were actually at 
rest on the earth as before, and that some influence had pushed 
the laboratory up at its eastern side, and so from east to west 
was “ downhill.’ If, however, they investigated the period of 
oscillation of a pendulum, they would find it to be shorter than 
before, and thus their “force of gravitation ’’ would be not 
only altered in direction to them, but would also be stronger. 
Of course, an outsider would refer to the new “ horizontal 
component of gravitation” as “fictitious.” That would be 
quite right from his point of view. But from the laboratory 
point of view, the new component would be quite as real as 
the original, for, in the last resort, gravitation is a name intro- 
duced for the purpose of conveying to the mind a certain com- 
mon feature of a wide range of mechanical phenomena. It 
does not purport to explain the “ why” of those phenomena, 
despite a rather popular, but fortunately fast waning, belief to 
the contrary. To the person in the laboratory everything 
tends to move towards the western wall with a definite accelera- 
tion ot speed in that direction, and that is the unique feature of 
a ““ gravitational field ’’ wherever it is encountered. The imme- 
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diate point is that he can still summarise all his mechanical 
experiences in the laws of motion, provided he uses as his force 
of gravitation, not the force familiar to the person at rest on 
the earth’s surface, but the iorce as modified by the new 
westerly component. Moreover, if another laboratory passed 
by his own, and he could see into it and make observations on 
events there, he would probably perceive that the gravitational 
force in laboratory number two is different to that in his own, 
both in magnitude and direction ; but this would give him no 
clue as to whether his laboratory is at rest and the other is 
absolutely accelerated with regard to his, or vice versa. 

It may occur to the reader that if gravitation can be modified 
in this way by merely transferring our thoughts to another 
frame of reference, it may be possible to “ modify it away 
altogether.’” It certainly is, though if we transferred ourselves 
and not our thoughts to some of the frames the results would 
probably be fatal. Recall the familiar “ sinking sensation ”’ 
when one is in a lift and it is just beginning its descent to a 
lower level. During a large part of the journey, the sensation 
is absent, because the lift is descending uniformly ; but for a 
brief interval at the start the lift is picking up speed, it is being 
accelerated downwards, and during this interval the pressure of 
everything on the floor decreases. In fact, if the gear broke 
and the lift fell freely down the shaft with the natural accelera- 
tion of gravity, nothing on the floor would press on it; if any 
unfortunate person in the lift ceased to hold any article in his 
hand, it would not fall to the floor. No doubt other people 
would say it was falling, and just in the same manner as the 
lift; but they are referring experience to another frame. 
People in the lift would for a short time at all events experience 
none of the familiar effects of gravitation. Nothing would fall 
in the lift; articles in motion would move in straight lines 
through the lift ; the floor would cease to press people upwards— 
the most persistent and universal appeal of gravitation to our 
senses—and very uncomfortable they would be when missing 
this familiar pressure, even apart from the dread of the final 
crash. We can relate this to the moving laboratory, where an 
eastward acceleration of the laboratory relative to the earth 
introduced into the laboratory a westward “ force of gravita- 
tion ’’ which did not exist in the outside frame. So here, a 
downward acceleration of the lift relative to the earth introduces 
into the lift an upward “‘ force of gravitation ” (non-existent in 
our usual frames), and this just compensates the already 
‘existing downward force and so results in the absence of 
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gravitation, i.e, absence of the familiar features of falling 
bodies, curved trajectories of projectiles, supported bodies, etc. 
It is of interest to note in passing that quite the opposite effect 
takes place at the conclusion of a lift descent, i.e., a usual 
descent and not the disastrous episode pictured above. Then 
the lift is slowing up, and this is equivalent to an acceleration 
upwards relative to the earth, introducing a downward “ force ” 
into the lift in addition to the usual one, and resulting in a brief 
increase of pressure of ourselves on the floor, and an increased 
acceleration in the speed oi any unsupported body towards the 
floor. 

Let us abstract ourselves from the fatal surroundings of the 
dropping lift ; let us put the lift—or, for convenience, a more 
commodious room like our original laboratory—out “in 
space.” It will be admitted that if anywhere near the solar 
system, the laboratory would be moving relatively to the parts 
of this system. It might be “ falling towards the sun,” or 
perhaps towards one of the other members, if near enough; 
on the other hand, we could avert the still impending, but 
postponed, disastrous collision by arranging to have it moving 
in an orbit, just like a planet, satellite, asteroid, or comet. 
We will waive the difficulty as regards the subsistence of life in 
such peculiar circumstances. M. Verne and Mr. Wells have 
overcome little troubles like that for our amusement and 
delight. Having, like those intrepid voyageurs, Servadac and 
Cavor, “‘ changed our frame of reference’? not merely to a 
comet or the moon, but to something still more exiguous, an 
“asteroidal laboratory,’’ whose own gravitational influence is 
too feeble to be considered, we can at leisure and without 
unpleasant anticipations study the conditions in our new 
abode. After a time, we would become accustomed to the 
“sinking sensation,” i.e., the absence of pressure on the floor ; 
indeed, there would be no floor. Gravitation would be absent 
in our room; we would have to take great care that an in- 
cautious muscular effort when in contact with any of the walls 
did not project us violently against the opposite wall or result 
in permanent banishment from our new home, if an opening 
in the wall allowed us to pass through to the unpleasant fate 
of becoming a lonely asteroid. In short, our frame of reference 
would be “ gravitation-free.”” We have not necessarily to 
carry our thoughts to enormous distances from suns and stars 
to form the idea of a “place where gravitation does not 
act.2 


We thus can conceive of the existence of frames of reference 
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in which over a limited region gravitation effects are practically 
absent in the sense that accelerated motion relative to this frame 
is not a natural and striking feature of moving bodies as it is 
relative to earth, for instance. If in such a frame we should 
desire to reproduce the familiar features of gravitation, we could 
do so by moving a second frame through it with accelerated 
velocity. On an asteroid, for example, “ natural” gravitation 
would be almost absent, but if our laboratory were running 
carriage-wise along its surface with an accelerated speed, we 
would produce within the laboratory a “ gravitational field ” 
in a direction opposite to the laboratory’s acceleration, this 
being the only gravitational field in the laboratory now, and 
not merely an additional amount as it was when we conceived 
the operation carried out on the earth. The conclusion to 
which this line of thought leads is the relativity of gravitation 
in so far as mechanical effects are concerned. If we have not 
merely to admit inability to measure a velocity in absolute space, 
but also inability to measure absolute acceleration, we cannot say 
how much of our actual gravitational field is ‘‘ real’’ and how 
much is “ fictitious ’’ and due to such an acceleration. Indeed, 
in the absence of such knowledge, the idea of an absolute 
acceleration of the earth in space is as irrelevant for physical 
science as the idea of an absolute velocity. There is, however, 
one qualification to be referred to at this point. We do measure 
a rotation of the earth on its axis with reference to the so-called 
fixed stars. Moreover, we also measure a rotation by means 
‘of such apparatus as the Foucault pendulum, where we do not 
postulate the fixed stars as a frame of reference ; in fact, this 
rotation is the rotation of the plane of vibration of the pendulum 
relative to the objects in the laboratory ; but our minds being 
prepossessed with the idea of an absolute space, it has been the 
custom to refer to the Foucault effect as due to the rotation 
of the earth in an absolute space. It so happens that the two 
rotations agree experimentally, which has led to an assumption 
that we can use the mean of the “ fixed stars”’ as a frame of 
reference which may be treated as absolute in so far as rotation 
is concerned. On such an assumption it is clear that our 
gravitational field would contain a small “ fictitious ”’ part, due 
to centrifugal action, which would be at right angles to the 
axis of rotation and equal to wv where o is the angular velocity 
of the earth and v the distance of the part of the earth con- 
sidered from the axis. The remaining part would, on such a 
view, be the “real” gravitational field (unless there be some 
~ other “fictitious ”’ part due to an acceleration of the translatory 
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motion of the earth in absolute space). Needless to remark, ~ 
such separation of a gravitational field into real and fictitious 
parts is quite foreign to the standpoint of Relativity. The 
field depends on the frame of reference, and the change involved 
in passing from one frame of reference to another must be 
calculated, not in terms of unobservable movements in an ab- 
solute space, but in terms of observable relative velocities or 
accelerations. Thus the two fields referred to above would 
differ from each other by a term depending on the relative angu- 
lar velocity of a frame not turning with respect to the fixed 
stars and a frame, like the earth, which is so turning. It is 
true that the mathematical form for this term has been arrived 
at by reasoning which seems to postulate an absolute rotation 
inadvance. But after all, we are not committed to the absolute 
validity of this form. It is, indeed, like many other results 
deduced from the Newtonian dynamics, a close approximation 
in the actual cases where we make practical use of it, but if 
extrapolated beyond these limits it lands relativist and non- 
relativist alike in the difficulties of conceptually infinite space ; 
and, although this is a point which cannot be treated here, but 
must be deferred until the end of this book, the relativist does 
not make such an extrapolation, but has indicated more closely 
the mathematical method which must be pursued in order to 
obtain the correct relation between the field in one frame and 
the field in a second, having a definite angular velocity relative 
to the first. 

Mechanical effects apparently give no decided evidence on 
the existence of an absolute acceleration any more than of an 
absolute velocity. It is true also that optical and electro- 
magnetic experiments yield no support to the existence of an 
absolute velocity. But there still remains open the possibility 
that optical experiments might provide evidence in favour of 
the view that absolute acceleration is an observable pheno- 
menon. For instance, a ray of light as observed by two people 
each using a frame of reference accelerated with respect to the 
other, would have two different geometric forms whose relation 
to one another could be calculated in terms of the known 
relative acceleration. Now, if a ray of light be observed in 
one gravitational field, and another ray in another field differing 
from the former by an amount equivalent to the relative 
acceleration just mentioned, it is clearly a matter of experiment 
to determine whether the two rays have the same geometric 
form or not, and if not, whether the relation between the forms 
is the same as in the previous case or not. If the forms are 
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the same or if they differ in a manner which is not equivalent 
to the differences introduced by simple change of reference 
frame, then although “real’’ and “ fictitious” gravitational 
fields would be equivalent in their effects on mechanical pheno- 
mena, they would not be equivalent in their effects on optical 
phenomena. What Einstein postulates is that they are com- 
pletely equivalent, and so adds the “ Principle of Equivalence ”’ 
to his previous postulates. 

It follows from this that a ray of light should be deviated 
in passing near a large gravitating mass, just as a particle of 
matter would be if it had the same speed. But the reader must 
be careful at this point. He has just learnt that the behaviour 
of a particle of matter as deduced from Relativity dynamics 
would be very different from that deduced from classical 
dynamics. On the older theory, combined with the Newtonian 
law of gravitation, the principle of Equivalence would yield a 
deviation of a little less than a second of an arc in the case of 
a ray of light passing close to the sun. But Relativity dynamics 
combined with Einstein’s law of gravitation (to which reference 
will be made presently) gives the deduced deviation as just 
double the former amount. So observation is not so much 
concerned with verifying the fact of a deviation, which is quite 
compatible with older views, but in obtaining as exact a 
measurement as is possible. The eclipse expeditions of 1919 
certainly gave considerable support to the Einstein value of 
nearly two seconds, but the matter is so important that further 
verification is eagerly awaited. 

To grasp the full import of Einstein’s law of gravitation and 
comprehend the attitude of mind which rejects the Newtonian 
law as inadequate, and accepts Einstein’s as a more complete 
summary of our knowledge, is impossible without the aid of 
mathematical analysis. Unfortunately for semi-popular ex- 
planation, the sort of analysis required is unfamiliar even to 
those who have taken the customary University courses of 
instruction. It is not that it is especially difficult or full of 
artifices and ‘‘ dodges ’—far from it. It is plain and straight- 
forward enough, but in a word, it has not usually been taught 
in the past, whatever may happen in the future. At this 
stage we can merely state that Einstein’s theory is a hypo- 
thesis concerning the ‘‘ curvature of space-time,’”’ and proceed 
to adumbrate what that peculiar phrase means. 

We are familiar with the treatment of geometrical figures 
on a plane surface by means of Cartesian or polar co-ordinates. 
‘Certain readers may also be aware that there is an analogous 
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treatment of figures on a spherical surface. As a matter of 
fact, there exists a very important branch of geometry which 
treats of figures on any surface in terms of co-ordinates which 
were first introduced by Gauss. Suppose we have a certain 
surface S, and that we consider the curves in which it is cut 
by a surface 


S(%, yy, 2%) =u 


where u is a parameter and x, y, z are ordinary three-dimen- 
sional Cartesian co-ordinates. If we vary the value of the 
parameter u we obtain a family of curves on S. Consider 
another family of surfaces, 


$(%, Y, 2) =v 


where v is another parameter; this gives another family of 
curves on S._ If we assign definite values to u and v, we define 
a point on the surface S, and we can call « and v Gaussian co-or- 
dinates of that point ; the aggregate of the curves formed by the 
intersection of S and the f surfaces, and of S and the ¢4 surfaces, 
constitutes a mesh on S. Of course, we can “‘ transform the 
co-ordinates ’’ by changing the mesh system, i.e., by altering 
the functional forms of f and ¢; so that assigned numerical 
values of « and v would not define the same point on S as 
before. 

Let us consider two points, P and Q, on S, which are very 
near one another, one being defined on some mesh system by 
definite values of # and v, and the other by uw + du, v + dv. 
It can be shown that the square of the element of length PQ 
is equal to 

&,9U? 4- &o.du? + 22, .dudv, 


where £41, 220, 212 are three functions of w and v, whose func- 
tional forms will depend on the form of the surface S and on 
the functional forms of fand ¢. The length of a curve drawn 
on the surface between two finitely separated points A and B 
is given by 
B B 
(= | @rdu® + goado® + 2g, dude)! 

the line of integration being the curve itself. Two very import- 
ant results can now be quoted. First, defining a “ geodesic ” 
as a curve on the surface such that tbe length of it between 
any two points A and B is either greater or less than that of 
all neighbouring curves drawn on the surface between A and 
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B, we find that the equations of a geodesic are two differential 
equations in d?u/ds?, d°v/ds®, dujds, du/ds, in which the coeffi- 
cients are definite mathematical functions of the g14, 209, Zin 
and their differential coefficients with respect to # and v. 
We may alter our mesh system, i.e., alter the functional forms 
of f and ¢ in x, y, z, and thus transform our co-ordinates and 
alter the functional forms of gy), £22, 12 in u and v, yet the 
differential equations of a geodesic retain the same mathemati- 
cal form ; they are ‘‘ covariant.’’ The second result concerns 
the “‘ curvature ’’ of S, which is defined as follows. Consider 
a small element of area surrounding a point P on S, and draw 
normals to S at all the points on the boundary of this area. 
From any arbitrary point draw a series of lines parallel to these 
normals, forming a cone which will cut out an area on the sur- 
face of a sphere described round the arbitrary point as centre. 
If the radius of the sphere is so adjusted that this area is equal 
to the original element of area on S, then S is said to have at the 
point P the same curvature as this sphere, and the quantitative 
measure of this curvature is taken to be 1/R?, where R is the 
radius of the sphere in question. Gauss succeeded in showing 
that this curvature is equal to a certain mathematical expression 
involving the £11, £20, £12, and their first and second differential 
coefficients witii respect to wand v; and, once more, the expres- 
sion is independent of the functional forms of f and ¢ in %, y, z, 
i.e., of the mesh-system or system of co-ordinates. He was 
further able to show that any deformation of S which does not 
involve straining, crumpling or tearing does not alter the values 
of the curvature at each point, so that, for example, the curva- 
ture is zero not only for plane surfaces but for all surfaces 
which can be developed into a plane; and also that if a sur- 
face have finite values of curvature everywhere, it cannot be 
developed into a plane. 

Some years later Riemann put forward the idea that our 
own space might require similar general treatment ; that it 
is not a foregone conclusion that physical measures of dis- 
tance, if sufficiently refined, would agree with the conclusions 
deduced by geometrical treatment based on Euclidean assump- 
tions, and, in particular, on the parallel postulate. It might, he 
argued, be necessary for a more complete treatment of geometry 
to postulate that the distance between two neighbouring points is 
given by a general quadratic function of the three differences ot 
the co-ordinates involving as coefficients six functions of the 
co-ordinates, and not by the simple quadratic form employed 
‘in the customary treatment, viz., the sum of the squares of 
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the co-ordinate differences. It may be as well to warn the 
reader at this point that, of course, if we adopt polar co- 
ordinates, ellipsoidal, cylindrical, or any other variety of 
curvilinear co-ordinates, we do have to employ a general 
quadratic function of co-ordinate differences; but there is 
always implicit in conventional geometrical reasoning the hypo- 
thesis that it is possible to choose a system of co-ordinates, the 
Cartesian, for which the expression takes the form with constant 
coefficients. Now it is just this hypothesis which Riemann 
questions. We certainly cannot do the analogous thing in two 
dimensions on a surface with essential curvature. We can 
only be suie by direct physical measurement of sufficient 
precision that we can or cannot do it in any three-dimensional 
geometry which has to be employed in reasoning about physical 
phenomena. The main conclusion as regards this part of 
Riemann’s work is this. If we consider the mathematical 
expressions which are the natural generalisation for three 
co-ordinates of Gauss’ single curvature expression for two, 
and are, like Gauss’ curvature, expressions involving the six 
g-coefficients now required and their first and second differen- 
tial coefficients with respect to the co-ordinates, then if 
these expressions are zero, space is Euclidean, flat, “‘ homa- 
loidal ;”’ if they are not zero, there is an essential ‘“‘ curvature ”’ 
in space, which may be negligible over a small enough region 
(just as a small portion of a surface may be regarded as nearly 
flat), but cannot be neglected over a large region without some 
contradiction with physical fact arising. 

Finstein’s treatment of the general relativity of space, time, 
gravitation, and electromagnetism depends in general on the 
adoption of the same attitude towards the four-dimensional 
continuum formed by physical events, and in particular on the 
employment of the same mathematical method, adapted to 
deal with four co-ordinates instead of two or three. In the 
previous pages we have seen how the spatial co-ordinates and 
the time co-ordinate of any physical event are more closely 
connected than has been suspected hitherto and how the trans- 
formation of co-ordinates involved in changing one’s frame of 
reference, in general, affects all four. The aggregate of all 
physical events we call “ space-time,”’ or the “‘ world.” Now, 
two events in this world have a certain “interval” or 
“separation ’’ between them which is capable of mathematical 
definition. From an examination of equations (2), (3), or (4), 
will be found that 


c2St? — dx? — dy® — 8g? = c2S1’2 — Bx’? — Sy’? — §2/2 
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where %, y, 2, t and x + dx, y + dy, z+ 82, ¢ + 8 refer to two 
defined events in one frame of reference, and x’, Weel andl 
x’ + 8x’, etc., refer to the same two events in another frame. 
It appears, therefore, that two definite physical events possess 
the property that there is a definite relation between them 
which is determined by the special quadratic expression 


c2dt2 — bx — dy? — $2? 


for this expression maintains a definite value, no matter what 
frame of reference the events are viewed from, provided we 
accept the limitation of uniform motion imposed by the 
restricted principle of Relativity. In order to free himself 
from this restriction, Einstein postulates that this separation 
is a physical quantity independent of any frame of reference, 
and that its complete expression depends on a general quadratic 
function of the four co-ordinate differences. Such an expression 
will involve ¢en terms, and so ten coefficients, 211, £o0, £39 
Lay S12 - - - &34, Will appear all functions of the co-ordinates 
in general. The actual functional forms of these coefficients 
will vary with a change of reference-frame, i.e., with a trans- 
formation of co-ordinates, and Einstein assumes a close corre- 
lation between the changes in these functional forms and the 
change in the gravitational field which is involved, and which 
has been previously pointed out, a correlation which can only 
be stated precisely in terms of mathematical symbols. Einstein 
succeeds in throwing the laws of kinematics, dynamics, and 
electromagnetic theory into equations which involve the 
symbols for the usual physical concepts of velocity, force, mass, 
energy, electric and magnetic intensity, etc., and the ten 
g-coefficients, and these equations preserve the same mathematical 
form no matter what change is made in the reference frame, 
i.e., no matter how the co-ordinates are transformed. Thus is 
the general principle of Relativity satisfied. 

Gravitation has been omitted in the catalogue of terms just 
written. This has been done advisedly, for in Einstein’s theory 
gravitational force has no place as a fundamental concept. In 
Newton’s theory rectilinear motion in an absolute space— 
inertial motion—is postulated as a natural phenomenon, and 
so gravitational force has to be postulated also as an explanation 
why this natural motion is not in general adhered to. Tor 
Einstein there is no absolute space ; but there are “ natural ”’ 
paths for a material particle in any definite frame, such paths 
as, in fact, it would pursue provided it be freed from air re- 
‘ sistance, mechanical shocks, etc. They are not in general 
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rectilinear, but, provided the conditions above are obeyed, they 
are adhered to, and so no influence has to be postulated to account 
for a deviation from them, since there is no such deviation. 
If one asks what is Einstein’s criterion for the “ naturalness ” 
of a path, the request can only be answered in mathematical 
terms. It was mentioned above that on a surface two covariant 
differential equations constitute the equations of all geodesic 
lines, and such lines do possess a unique property which singles 
them out from all possible lines on the surface. Einstein 
employs the four differential equations which are the 
generalisation for four variables of the equations of surface 
geodesics for two, as the equations of his natural paths. Of 
course, these equations single out, in the first instance, lines in 
the four-dimensional world of space and time, “ world lines ”’ 
of particles ; the “‘ projection ”’ of these on any given observer’s 
space will be the natural spatial paths in his frame of refer- 
ence. 

Lastly, what of the law of gravitation itself? Again, . 
the reply comes that it is a set of differential equations, and 
the expressions involved in these equations are twenty functions 
which are the natural mathematical generalisation for four 
variables of the expressions for the curvature of a surface in 
two variables, or the Riemann “curvature components” of 
space in three. If these twenty functions were zero, Einstein 
assumes that either gravitation would not occur in our frame 
of reference, or, at all events, it could be entirely removed by a 
change of frame. But we cannot remove it in such a wholesale 
manner ; we can remove it, as we have seen, over a small 
region, just as we can regard a small element of a surface as 
flat. Hence there is some essential “ curvature ’’ in space-time 
which cannot be entirely removed by any choice of reference 
frame, ie., by any change of co-ordinates. The twenty 
‘curvature’ components are not individually zero. Yet 
there must be some mathematical relation between them to 
correspond to those physical properties which are common to 
our space and time, no matter what frame of reference is used. 
Einstein made a brilliant guess that although the individual 
curvature components are not zero, certain linear functions of 
them are zero, for regions outside matter, and are equal to a 
similar group of functions built on the energy, stress, and 
momentum of matter within matter itself. 

But we have reached the limit where description in general 
terms ceases to be of service, and it is time to take up the 
mathematical development of Relativity in earnest. 
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CHAPTER I. 


WE must now begin a careful study of the conclusions which 
can be drawn from the definite pronouncement of experimental 
research, that however useful the ether may have proved to be 
as a conceptual medium for the propagation of radiation, it has 
failed to ‘‘ materialise” as a system of reference privileged to 
assume the title “ fixed,’ in contrast to all bodies in the 
universe. 

It was Einstein who, in 1905, grasped the full import of this 
failure and realised that all physical laws must be framed so 
as to summarise the results of measurement and observation 
in the same terms, no matter what the system of reference 
might be, in which the observers and their apparatus are 
situated. At the outset of his researches, however, he confined 
himself to the question of framing laws in the form of equations 
which, if valid for a system of reference S, would also be valid 
for any other system of reference moving relative to S with a 
uniform velocity. It is sometimes stated that Einstein has 
“ abolished ” or “ abandoned” the ether. This is one of those 
dangerous half-truths which are apt to create serious miscon- 
ception in the minds of those who are interested in scientific 
research, but are precluded for one reason or another from 
reading the original literature. 

What Einstein pointed out in 1905 was the fact that if one 
adopts the Lorentz equations for transforming from axes fixed 
in the ether to axes fixed in a body moving uniformly through 
the ether, it becomes impossible to decide which of the two 
systems of axes is fixed in any absolute sense ; indeed, it is 
impossible to say if either of them is fixed. This arises from 
the reciprocity to which attention has already been drawn. 

Thus, in considering the formulation of general physical 
laws as differential equations valid for any system of reference, 
the question of the existence of the ether is irrelevant [It is 
of no more service for that specific purpose than any body in 
motion ‘with respect to it. But that does not amount to a 
denial of its existence. Certainly, as will be seen presently, it 
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becomes impossible to reconcile the existence of an immovable, 
“stagnant ” ether with the standpoint of Relativity. 

It is the demand for a formulation of laws valid for any 
system of axes which constitutes the starting-point of the new 
theory. Its first postulate reads : ‘“‘ The laws of natural pheno- 
mena are the same whether the phenomena are referred to the 
framework of reference S to any other framework S’ moving 
relatively to the former with a uniform velocity and without 
rotation.” Or still more precisely we may phrase it thus: 
“The equations which summarise the sequences of natural 
phenomena have the same form when referred to a given set 
of axes as when referred to a second set moving relatively to 
the former with a uniform velocity and without rotation.”’ 

It is most important to observe that Einstein takes as his 
fundamental postulate a statement which appears as a final 
(and to some extent an approximate) deduction of the work of 
Larmor and Lorentz. In so doing he has introduced, as will 
appear in due course, a notable simplification into the analysis 
of dynamical and electromagnetic problems. Even before the 
appearance of his gravitational theory he had shown how a 
number of phenomena, previously explicable by the introduction 
of certain ad hoc assumptions into electromagnetic theory, 
could be easily deduced from the restricted principle. 

In addition to the postulate enunciated above, Finstein 
assumes that 7m vacuo light is propagated with a velocity which 
is determinate for all observers, and is independent of any 
motion which the source may have, relative to any given observer. 

This second postulate, while quite consistent with experi- 
‘ mental results, cannot be readily reconciled with the view that 
the medium of transmission of light is a fixed and stagnant 
medium. It is true that the latter theory assumes that the 
velocity of light is independent of the motion of the source 
velative to the ether ; but it also assumes, on the basis of classical 
relativity, that it is not independent of the motion of the 
observer relative to the ether. As a consequence, the “ velocity 
of light” would refer to the velocity relative to an observer 
fixed in the ether, while the velocity relative to any other 
observer would be obtained by adding to this velocity the 
reversed velocity of the observer through the ether. But it is 
just this deduction from the classical theory which is con- 
tradicted by experimental research, unless one postulates the 
Lorentz contraction for matter in motion through the ether. 
Einstein, on the other hand, makes direct use of the experi- 
mental results, and the second postulate (the principle of 
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constant light velocity) which he bases on them determines the 
relations which must hold between the spatial and temporal 
measurements of different observers. 

At the time at which Einstein began his researches, the 
evidence upon which he based this postulate was the negative 
result of the Michelson-Morley experiment, the principle of 
which was outlined in the previous chapter, a full account 
being obtainable in standard text-books of Optics. But the 
conclusions derived by combining the principle of Relativity 
with the assumption that the velocity of light is independent 
of the motion of its source are so extraordinary, that attempts 
were made to avoid these conclusions by substituting for 
Einstein’s second postulate the hypothesis that the velocity of 
propagation is the vector sum of the real velocity of light and 
the velocity of the source, a return, in fact, to something akin 
to an emission or ballistic theory of light. Such a hypothesis 
would lead to a very simple Relativity theory involving no 
modification of our ideas of space and time, and would yield 
an extremely simple explanation of the result of the Michelson- 
Morley experiment, but being radically in contrast with the 
electromagnetic theory of light, obtained little favour. Still, 
as the existence of a medium for an undulatory propagation 
of light is not vital to the Relativity theory, relativists cannot 
offer such mere lack of popularity as a valid objection to an 
emission or quasi-emission theory of light. So it is necessary 
to refer, even if only briefly, to various suggestions which have 
been made in this direction, and the bearing of experimental 
evidence on them. 

Supposing we adopt an undulatory theory coupled with 
the hypothesis that the velocity of light is affected by the 
velocity of the source in a vectorially additive manner, it can 
be shown that the wave-length of the light is unaffected, while 
the frequency of the light is altered in a manner according with 
the usual Déppler principle. Of course, without this hypothesis 
both wave-length and frequency suffer a Déppler change. 
Now, as the dispersion of a prism is a matter which depends on 
frequency (at least, according to all the theories of dispersion 
hitherto proposed), it would be fruitless to attempt to decide 
between the rival positions by observing the displacement of 
spectral lines by means of prisms. But observations with a 
grating depend on wave-length, and so if the suggested additive 
hypothesis were true, a grating should give no appreciable 

_result in the study of the Déppler effect, and this is not in 
agreement with experience. Experiments such as those of 
3 * 
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Galitzin and Wilip * on the stars and the limb of the sun, and 
of Stark t on the canal rays, agree with the conclusion that 
the velocity of light is constant and independent of the velocity 
of its source. There are two criticisms, however, which may be 
directed against these results. One is that astronomical obser- 
vations of the Déppler effect are not always made with an a 
priori knowledge of the relative velocities of source and observer, 
and in observing the canal rays the measures are of small 
precision. The second is that in so far as mirrors which are 
in motion are employed, the conditions are further complicated 
by the fact that these cannot be compared to moving sources, 
and may produce different consequences. In reply to the 
former criticism, it has been pointed out by Comstock ¢ and 
de Sitter § that apart from the Déppler effect there is evidence 
from the behaviour of binary stars against any form of theory 
requiring the additive velocity, for it would necessitate a 
difference between the observed time of one half-rotation of 
each member round the other, and the time of the other half- 
rotation, amounting to 4vl/c? where v is the velocity of rotation, 
c the velocity of light from a fixed source, and / the distance 
from the earth. Such a result has not been observed, although 
it appears that its detection would be quite possible if it existed. 
As regards the criticism of the use of moving mirrors, emission 
theories differ in their assumptions as to the behaviour of light 
after reflection from a moving mirror. Three assumptions have 
been employed in various researches. The first is that the 
mirror acts as a new source, and the reflected light has the 
same velocity relative to the mirror as the incident light had 
relative to its source. This has been considered by Tolman || 
and shown to be incompatible with experiments on the velocity 
of light from the two limbs of the sun and with measurements 
of the Stark effect in canal rays. The second assumption, 
made by Stewart,** viz., that the velocity added to the reflected 
light is that of the image of the original source (so that the light 
has a velocity c relative to this image), has also been dealt with 
by Tolman, and shown to be in contradiction to the Stark 
effect measurements in canal rays. The third assumption has 


* “Communications Acc. Russe’”’ (1907), p. 213. 

7 “Ann. Physik,” 28 (1909), p. 974. 

{ “Phys. Rev.,’’ 30 (1910), p. 291. 

§ “ Phys. Zeit.” 14 (1913), pp. 429, 1267. 

|| ““ Phys. Rev.” 31 (1910), p. 26; 85 (1912), p. 136. 
** Tbid., 82 (1911), p. 418. 
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been introduced by Ritz * into a rather complete emission 
theory. It is that the light retains throughout its whole path 
the component of velocity which it obtained from its original 
moving source, and hence, after reflection, the wave front 
travels out with velocity ¢ from a point which has the same 
velocity as the original source had at the moment of emission. 
Recently Majorana t has undertaken a direct experiment on 
this question of the velocity of light from moving mirrors, and 
finds that his results authorise the conclusion that reflection 
of light by a moving metallic mirror does not alter the velocity 
of propagation of the light itself, and so disposes of the assump- 
tions (I) and (2) above. But, as Majorana is careful to point 
out, only direct experiments with interferential arrangements 
on the velocity of propagation of light from a source set in 
motion artificially have any bearing on Ritz’s theory, and in 
a later paper { he describes the results of some experiments in 
which the difficulties against endowing a source with a large 
enough velocity (several hundreds of metres per sec.) and 
maintaining the light sufficiently monochromatic are overcome. 
Concerning these and similar results, he says: ‘‘ From the 
researches made by Michelson, Fabry, and Buisson, and by 
myself, it results that the velocity of light is not influenced by 
reflection on mirrors or reflecting surfaces; from those now 
described by me, it results that the said velocity does not 
change by the movement of the source. These facts are surely 
in harmony with the theory of Relativity ; but really, in spite 
of their evident interest, they cannot logically be cited as sure 
experimental proof of this theory. In fact, two experimental 
circumstances must not be forgotten: first, the presence of 
materials which are traversed by the interfering rays (air, glass, 
metals) ; and second, the gravitation field of our earth. While 
it is possible to imagine experiments entirely apart from the 
former, it cannot be foreseen if later experimental results will 
bring into evidence the eventual influence of the second.” 

The weight of all the available experimental evidence seems 
to favour the validity of Einstein’s second postulate, and we 
shall proceed to show how the Lorentz-Einstein equations may 
be derived from it. 

Let us consider two material bodies in uniform relative 
motion to one another, each body serving as a framework of 


* “ Ann. d. Chim. et Phys.,” 13 (1908), p. 145; ‘“‘ Arch. d. Genéve,”’ 

26 (1908), p. 232; “Scientia,” vol. 5 (1909) ; ‘‘ Collected Works,”’ p. 317. 
+ ‘‘ Phil. Mag.,”’ 35 (1918), p. 163; “Phys. Rev.,’’ 11 (1918), p. 411. 
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reference for a group of observers situated on it. Each group 
chooses a set of rectangular axes. Let us denote those for the 
first group by OX, OY, OZ, and those for the second group by 
O’X’, O'Y’, O'Z’.. The co-ordinates of a point in space referred 
to the first set of axes we denote by (x, y, z), and if referred to 
the second set, by (x’, y’, 2’). These co-ordinates are measured 
by standards which are assumed to agree perfectly when placed 
together for comparison. To observe the time of an event, 
each member of a group makes use of a timepiece which is 
assumed to be in perfect agreement with all other clocks 
employed by that group. All clocks 1m one group are syn- 
chronous if the time at which a light signal is despatched by 
an observer A (as read on his clock), and the time of its arrival 
at an observer B (as read on his clock), differ by //c where / is 
the length of AB and ¢ is the velocity of light, this being true 
for all possible lengths and directions of AB in this group. 
This method of synchronising the clocks in one group with each 
other implies that, if a light signal is despatched from a point 
A at time ¢#, (read on a clock at A) arrives at B at a time #, 
(read on a clock at B), and is reflected by a mirror at B back 
to A, returning there at a time #/, (read on the clock A), then 
t, = 4(¢, + ¢#,). A good deal of stress is laid on this process of 
synchronism in Einstein’s original paper, and a critical discus- 
sion of it is contained in Chapter IV. of Dr. Silberstein’s book. 
One essential feature of it, which must be borne well in mind, 
is the employment by both groups of observers of the same 
value for the velocity of light. For by the second postulate 
c is a universal constant (in cm. sec. units, approximately 
Boos Tory. 

By these ideal devices the observers of any given group who 
are supposed to be in relative rest in their framework can 
assign four co-ordinates to an event, viz., the three Cartesian 
co-ordinates of the place (idealised as a point) where the event 
happens, and the time of the event (idealised as instantaneous) 
read by an observer at the place on the clock which he possesses, 
or read on a given clock at the origin on receipt of a light signal 
from the observer, who despatches it at the instant the event 
happens. Of course, in the latter case the reading of the clock 
at the origin must be reduced by the interval required for the 
light to travel from the place to the origin. 

It is only by the employment of such devices that we can 
give any definite meaning to the word “‘simultaneity”’ as 
applied to events in different localities. When two events are 
actually perceived by one person, he relies on his own judgment 
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and skill as an observer to determine whether they are simul- 
taneous or not. Personal judgment in such matters is not 
perfect ; each observer has his “‘ personal equation.’”’ But 
such judgment is in the last resort independent of clocks and 
measuring appliances in general. The matter is quite different 
for events in different places which do not come under direct 
observation by one observer. Two observers, at least, are 
required. Each one must observe a simultaneity (in the per- 
sonal sense) between the event in his neighbourhood and a 
particular reading on his clock. But until the observers have 
communicated their readings to each other they are not in a 
position to make any statement about the simultaneity of the 
two events, or their order in time if they are not simultaneous. 
If this fairly obvious consideration be borne in mind, there is 
no need to fear the so-called paradoxical conclusions of relativity 
concerning time-order. 

We must now determine the relations which hold between 
the space and time co-ordinates (x, y, z, t) of a given event as 
determined by the first group of observers and the co-ordinates 
(x’, v’, 2’, t’) of the same event, as determined by the second 
group. 

It will simplify the analysis a little if we arrange the axes 
OX and O’X’ to be in one line, the relative motion of the two 
frame-works S and S’ being parallel to this line. It will also 
be convenient to assume that each group of observers chooses 
the original instant for time measurement to be the instant 
when O and O’ coincide. Let the S observers measure the 
velocity of the S’ framework relative to themselves as 4, 
ie., at time = 7 in the S framework OO’ = ut, O’ being, of 
course, on the axis OX. 

Suppose that at the instant when O and O’ were coincident 
a flash of light is emitted from a source at this (momentarily) 
common origin. Ata later time, the part of the wave travelling 
along the common axis OX will have reached an observer A 
of the S group situated at the point (x, O, O) and an adjacent 
observer A’ of the S’ group at the point (*’, O, O). Let the 
reading of the A clock be ¢ and of the A’ clock be ?’. 

Fig. 1 represents the state of affairs as it appears to the 5 
group when their clocks indicate 7. 

In the interval ¢, O’ will have moved from coincidence with 
O into coincidence with B where OB = 2#. Consequently, the 
observers in S will estimate the distance from O’ to A’ as 
x% — ut,“because that is the distance between two of their own 
observers who were respectively coincident with O’ and A’ at 
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the same instant, i.e. (as insisted on above), when the clocks 
at B and A recorded identical readings. This is the only method 
available for the S observers to estimate this distance ; for O’ 
and A’ are moving past them. Of course, to the S’ observers 
the length of O’A’ is x’ Now the relation between the lengths 
x’ and x — ut is not one of equality. Such an assumption puts 
us back once more into all the difficulties of classical relativity 
which Finstein’s relativity was designed to overcome. But 
the relation is one which must depend, according to Einstein, 
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on the relative velocity of the two frameworks, and not in the 
least on any supposed absolute velocities in space or through 
the ether of the two frameworks. Owing to the simplicity of 
the physical relation (uniformity of the relative motion), one 
‘naturally expects the mathematical relation also to be simple. 
So we shall assume a simple linear relation and write 


x = a(x — ut). : : ee) 


where a is a constant to be determined. 
But since light travels from O’ to A’ in the time interval 7’, 
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Fig. 2. 


therefore x’ = ct’; for all observers use the same value for the 
light velocity 

Similarly * = ct, and therefore ut = ux/c. 

Hence equation (1) can be written 

ct’ = a(ct — ux/c) 

or UY = a(t — ux/c?) : ; oan) 

Now let us consider the state of affairs as it appears to the 
S’ group of observers. 

This is represented in Fig. 2.* 


* The reader is warned that this is not the same arrangement as 
in Fig. 1. No S observer opposite O’ is put in the sketch, since none 
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An observer B’ of the S’ group is opposite O when the 
former’s clock registers ’, so that O is momentarily located at 
the point (w’t’, O, O) in the S’ framework, where w’ is the velocity 
of the S framework, as measured by the observers in the S’ 
framework. 

Reasoning just as before, we arrive at the equations 


x = a(x’ — wt’ ee) 
Pa ex yey a eG) 


where a’ is a constant. 

Equations (1) and (2) are to be consistent with (3) and (4). 
Hence, by inserting in (3) the values for x’ and ?’ from (1) and 
(2), we obtain 


x =a’ {a(x — ut) — u’a(t — ux/c*)} 
= aa’ {x(t + uu’ jc?) — (wu + w’)t}. 


If this is to be true for all values of ¢ and x, it follows that 


uw’ = — U 
and aa’ == 1/(r + uu’/c?) = 1/(1 — u?/c?), 


This means that each group estimates the relative velocity 
of the other group to have the same magnitude (but, of course, 
opposite directions). It follows as a justifiable assumption that 
a =a’, for each of these constants represents the ratio of the 
length of a rod as measured by observers at rest with regard 
to it, to the length as measured by observers moving parallel 
to it with a definite relative velocity which the latter estimate 
to be w in magnitude. 

Hence it follows that 


a= a = 1/(r — w/c)? = 1/(1 — u’*c*)s. 


These equations embody the idea of the Lorentz-litzgerald 
contraction, but with a marked difference to the original 
assumption. Originally, a rod was supposed to have an absolute 
maximum length when it is at rest in the ether, and this is its 
length for all observers whether they are at rest in the ether or 
not. If it is in motion through the ether in a direction parallel 
to its own length, its length shortens in the 1atio 1: (1 — w?/c)4 
where u is its velocity through the ether, and this length is again 


is required in the reasoning. But in any case it would not be the 
observer B of the previous figure, but one nearer to O than B; ie., B 
would be somewhat to the right of O’ in the figure. 
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independent of any observer’s velocity. But Einstein’s inter- 
pretation of the Lorentz equations assumes the length of a rod 
to be a relation between the rod and each particular observer ; 
it varies from observer to observer. To an observer in relative 
rest with regard to it, the length attains its maximum value ; 
and, theoretically, it may have any value between this and 
zero depending on the relative velocity of the observer to it, 
this result being quite independent of any velocity through the 
ether which may be attributed to it. 

We have still to determine the relations between the y 
and z co-ordinates in the two frameworks. This is obtained 
very readily. Consider once more the emission of the flash of 
light from the common position of O and 0’ at the common 
original instant, and the subsequent arrival of an element of 
the wave-front at a point P. Let d be the distance of P from 
its projection on the common axis as measured by the S group, 
and d’ its distance as measured by the S’ group ; and let x and 
x’ be the respective abscisse of the projection of P on this 
axis. Also let ¢ and ¢’ be the S and S’ times respectively of the 
arrival of the wave at P. Then by the second postulate 


x? + d2 = cif? 
and Me nd. eae 2 


Now if the relations between x, x’, ¢, and 7’ are assumed to be 
as before, it is easily proved that 


x2 — C242 — 4/2 — ¢2z/2, 


Hence, to render the two equations just written consistent, 
we must assume also that d = d’. In other words, the length 
of any body measured transversally to the direction of relative 
motion is estimated to have the same value by the two groups. 
Consequently our complete scheme of transformation is 


x’ = a(x — ut) 
YS eee 
t= a(t — ux/c?), 


or the equivalent scheme 


, 
, 


%= a(t’ — ut) 
Se ee 
t = a(t’ — w’x'jc?), 
where ui = — u 
and a = 1/(1 — u/c?)s. 


lt will be obvious that just as length is a relation of a body 
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to an observer, so also will volume, superficial area and shape 
depend on the relative velocity of body to observer. 

Suppose a body is moving with a uniform velocity w parallel 
to OX relative to the S frame of reference. If the dimensions 
of this body are to be measured by observers in this framework, 
they must contrive to obtain the co-ordinates of a sufficient 
number of points on its surface at the same instant, i.e., each 
point of the surface must have its co-ordinates measured at a 
definite value of the S time, say ¢ units after the original instant. 
Let (%1, Vy, 2;) and (%9, Vo, 22) be the values for two points P, 
and P, on the surface obtained in this way. To observers 
with respect to whom the body was fixed, these measurements 
would be a simpler operation, as they could naturally choose 
a framework in which each point would have invariable values 
for its co-ordinates. Let (x,’, y,’, 2,') and (%,', yq', 22’) be these 
fixed values for the points P, and P,. 


Then since %4' = a(x, — ut) 
Ke = a(x, — ut) 
) ig Sates 5 Mae 
i hee et 
yg 


Yo =Yas 22 = 2e 
it follows that 

Xe’ — X%y' = a(%_ — )) 

Je tit eee Od 

ge 

Since a is greater than unity, x,’ — 4%,’ is greater than 

%_ — %,, i.e., a linear dimension of a body as measured by 
observers to whom it is at rest contains a greater number of 
length units than the same dimension as measured by observers 
to whom it is in motion, provided the dimension is not directed 
at right angles to the relative motion of the body to the latter 
observers. It follows readily that the volume as measured by 
the observers to whom the body is at rest, is a times the volume 
as measured by the observers to whom it is in motion. This 
result is deduced by dividing the body into elementary parallel- 
epipeda, with sides parallel and perpendicular to the direction 
of relative motion, and integrating the result for each element 
of volume. These considerations also involve a difference of 
‘shape as viewed by different observers. A body which is 
spherical to an observer to whom it is at rest would be an 
oblate spheroid to one to whom it is in motion, with its short 
axis parallel to the direction of motion. A body which is 
spherical tqa.an observer to whom it is motion with a definite 
velocity would assume the form of a prolate spheroid to 
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observers to whom it is at rest, or travelling more slowly, and 
that of an oblate spheroid to observers to whom it is travelling 
more rapidly. It will be noted that the ‘‘ Lorentz factor”’ a 
or (I — u2/c*)t, increases without limit as the value of u increases 
toc. Doubtless, so long as we are concerned with such relative 
speeds as occur in terrestrial events, or even in stellar move- 
ments, # is so small compared to c that a differs but little 
from unity. But conceivably the dimensions of a body might 
shrink without limit to those of a thin disc, provided its relative 
speed to the observer were sufficiently increased. It is also 
worthy of remark at this stage that if « were greater than c, 
a would become an imaginary quantity. Later it will appear 
that there are very weighty dynamical reasons for believing 
that relative speeds greater than that of light are physically 
impossible under the conditions postulated in Restricted 
Relativity. 

These results concerning the relativity of shape and spatial 
dimensions are paralleled by a similar result concerning the 
relativity of intervals of time or ‘“‘ temporal dimension.” Sup- 
pose an occurrence which lasts for some time takes place on a 
body which is in relative motion to a group of observers. Let 
us postulate another group of observers to whom the body is 
at rest. The first group employs a framework S, the second, 
S’. How will the first group measure the interval of time 
occupied by the occurrence? One of the group could do it 
provided light signals were despatched to him at the initial 
and final instants of the occurrence, and provided he made the 
usual allowances for the distances separating the body from 
himself at those instants. But this is tantamount to arranging 
two of the S group, each with a synchronised clock, so that one 
is adjacent to the body at the initial instant, and the other at 
the final instant. The difference of the two time-readings 
would give the measure of the interval which is valid for all 
the S group. 

Idealising the body as a point for the purpose of mathe- 
matical expression, let it occupy a fixed position (x’, y’, 2’) in 
the S’ framework, and let an S’ observer also fixed there record 
the beginning and end of the occurrence as taking place at 
times ¢,’ and ¢,’ after the original instant. 

The first of the two chosen observers of the S group will 
occupy a position (%1, 1, 2,) in his framework at the beginning 
of the occurrence, given by 


hoes, , ve 8 
4 = a(% — u't,’) 
. id 

Sp Vy Se B 
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while the second of the two will be in a position (x2, y., z,) at 
the end of the occurrence, given by 


X%_ = a(x’ — u’t,’) 
— (Pe oa a , 
cid Seaeet Scat Wma ie 


But the important feature for our present purpose is not so 
much their positions as their time measurements of the initial] 
and final instants. These are ¢, and ¢, respectively where 


ty = a(t,’ — wv'x’ |c?) 
so that t, —t, = a(t,’ — ty’). 


This result may be translated into words as follows :— 

When an occurrence takes place in a locality, the measure 
of the time interval occupied by the occurrence, which is 
made by observers fixed at this locality, is the fraction 
t/a or (I — u?/c)t of the measure of this interval, which is made 
by observers in relative motion to this locality with velocity wu. 
Or we may put it in a more striking way. Let us take a pen- 
dulum as the material body of the previous reasoning, and a 
complete oscillation of it as the occurrence. Then it will be a 
quicker occurrence to an observer at rest beside it than to an 
observer moving past it, each one judging the time by clocks 
which were originally synchronised. Briefly, a clock in motion 
to an observer goes slower for jim than a similar clock at rest 
to him. 

These results concerning the relativity of our measures of 
spatial and temporal extension are obviously closely connected 
with the broader view which we have to adopt concerning 
simultaneity. Once more this is not an absolute property of 
events in themselves. Events may be simultaneous to one 
group of observers and not to another. This is the result 
which seems so paradoxical to the beginner, if he has not 
realised the nature of simultaneity in connection with events 
in different localities which do not come within the direct 
perception of one observer In illustrating this relativity of 
’simultaneousness from the Lorentz equations as interpreted 
by Einstein, we can deduce a result more general than the 
previous two; for the first of these deals with a purely spatial 
measurement at a given instant of time, while the second refers 
to a pure time measurement, the place of the measurement 
being fixed. Such operations are far greater abstractions than 
the ordinary man would be willing to admit; all measurement 
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is much more a blend of space and time measurement than 
appears at the first glance, and the cogency of this view will be 
brought home to us as we proceed. Consider, therefore, the 
interval of time occupied by an occurrence which begins at 
one locality and ends at another, or, to put it in a manner 
suitable for mathematical analysis, the interval between an 
event (regarded as instantaneous) occurring at one point and 
an event occurring at another point. 

The points P, and P, have co-ordinates (%,’, 1’, 2;'/) and 
(%»", Vo", 29’) in the framework S’ in which they are fixed, so that 
these co-ordinates do not alter with lapse of time. One event 
occurs at P, at the instant ¢,’ units after the original instant, 
as measured by an observer at P, who is fixed in S’, the other 
occurs at the instant ¢,’, as measured by an observer fixed at 
P,. (These events would be regarded as simultaneous by all 
observers in S’, if it transpired that ¢,’ = 7,’. That is the only 
criterion of simultaneity for this group of observers.) 

How will these events be regarded by observers fixed in 
another framework S in relative uniform motion to S’? As 
before, any measurements of time made by observers in S will 
agree with measurements by two special.S observers, of whom 
one is at P, at the moment the first event occurs there, and the 
second is at P, when the second event occurs there. The 
position of this first observer and his measure of the instant, 
in short, the S co-ordinates of the first event, are given by 


My = O(%, — Wt, ) 
==, PAT panes , 
Se = Vi CES 
/ 
t, = a(t,’ — u'x,'Jc*). 


The S co-ordinates of the other event are given by 


Ne = O(%y = UT) 
Yo= Ne ; 
tale. 
Hence 

to —t, = alt,’ — t,’) + au(x,’ — x,')/c? 


where T is the time required for light to travel a distance 
equal to the difference between x,' and x,’. (The double sign 
is necessary ; we use the plus sign if x,’ >x,, and the minus 
sign if 43 < %,'.)* 


* Remember that uw’ = — y, and yu is essentially positive in the figures. 
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In general, therefore, if ¢,’=7,’, then ¢, + ¢,, i-e., one event 
occurs at P, earlier or later than the other at P,., as viewed by 
S observers. 

It appears that if x,’ >-x,', T is positive and ¢,>2,. This 
means that if two events are simultaneous to observers who 
are fixed in the same framework as the places where the events 
happen, then for observers to whom the places are in uniform 
motion, the event happening at the place which is ahead of 
the other in its relative motion to these observers occurs later 
than the other. 

Indeed, it is possible to deduce an apparently more para- 
doxical result. For it is quite possible that ¢,’ might be greater 
than #,’, and yet less than ¢,’ + uwT/c?, and in such a case ¢, 
would be less than ¢, (adopting the upper sign), i.e., the order 
of the events in time would be opposite for the two groups of 
observers. 

Pe esuould be noted that if 7,=—«,, then 7,’ =,’ implies 
t, =t,. Soif the relative motion of the one group to the other 
is perpendicular to the line joining the points, then events 
occurring at these points which are simultaneous to the one 
group are simultaneous to the other. 

Before leaving this matter, we must once more take careful 
note that statements Jike these, which are such a blow to our 
preconceived notions of simultaneity, refer to events happening 
at different places. They do not refer to events happening at 
one place. In such a case we should equate %y’, yy’, 2,’ to 
%_', Vo, Zq respectively, and so obtain the equation arrived at 
earlier : 

bn — t= at, — 7; ). 


This, of course, leads to a different measure of the interval 
between the events, as made by each group (a result dealt with 
above), but it does not disarrange their order, for 


i St, 
according as 
Piet, 
We shall now deduce a result concerning angular measure, 


which will prove serviceable in the succeeding chapter. 
Let the line P,P, make an angle 6’ with O’X’, then 


tan 0’ = (ye — y1')/(%2' — %1’). 


(It is assumed, for convenience, that P, and P, are in the 
O’X’Y’ plane.) 
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P, and P, being fixed in the S’ framework, we know from 
our earlier results that their simultaneous positions (for S 
observers) in the S framework are connected with the S’ 
co-ordinates by the relations 


(%9' — %4') = a(%_ — %4) 

By Pim gh as ae eee 
Hence, as the line P,P, moves through the S’ framework, it 
remains constantly parallel to itself, making an angle @ with 
OX, given by 


tan 0 = (vz — ¥3)/(%_ — %;) =a tan @. 


It is readily seen from this result that the acute angle made 
by the moving line with the direction of its relative motion to 
the S observers is Jarger than the acute angle as measured by 
the S’ observers, to whom it is at rest. An exception, of course, 
arises when the angle is right, in which case 0’ = @ = a/2. 

In subsequent chapters we shall introduce a little simpli- 
fication into the Lorentz equations of transformation by adopt- 
ing as the unit of time, not the second, but the time in which 
light im vacuo travels over our unit of length. Thus if we 
employ the centimetre as the unit of length, the proposed unit 
of time would be 4 x 10~1° second approximately. This wil! 
involve writing u where we wrote u/c above; in fact, u will 
really stand for the ratio,of the velocity of relative motion of 
S and S’ to the velocity of light ; also, we shall write ¢ where 
we previously wrote ct. By so doing we rid ourselves of the 
necessity of using a symbol, c, for the velocity of light. In 
these “relativity”? units the equations of transformation 
become ; 

x = a(x — )) 


SV 2 = 2 : : = tS) 
an 
and 
% = a(x’ — u’'t’) 
Y= eH? - (6) 
t = a(t’ — u'x’) 
where u=—y’ 
and a == 1/(r — w?)t, 


The formal similarity of the « and ¢ equations in each group 
is more marked than before. 
It can easily be shown from (5) or (6) that 


t? Eas x2 ian a pee, 2 — {2 = x2 aS va as gz’? : (7) 
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or, more generally, if two events are recorded in the frame S 
by the co-ordinates (% , V1, 21, t) and (Xo, Vo, 22, t3), and the same 
two events are recorded in the frame S’ by the co-ordinates 
fa 1s 2, 2) and @,; Vo, 22’, fa'),-then it can easily be 
proved that 


ees (hg 4 3)* — (Va — 2)" — (Ze — 2)? 
Bee (eg A)? Py — 9)? 2 = 21)? (8) 


Either member of (8) is a measure of the square of the 
“separation’”’ between the events, and (8) is the mathe- 
matical expression of the fact that the separation is the same 
in all frames. 

It will be observed that if we choose a frame S, such that 
ate, V, = Vs, 2y = 2a, 1.€., if the frame is such that the 
place of occurrence of the events is fixed relative to observers 
in this frame, then the separation is equal to the interval of 
time between the events as measured in this frame, the 
“ proper ”’ interval. 

In the analysis of this chapter the axis of OX has a fictitious 
importance thrust upon it. This arises because of our choice 
of it as parallel to the relative direction of motion of the two 
frames. Ina later chapter it will be shown that by abandoning 
this limitation a very elegant and symmetrical form can be 
given to the Lorentz transformation equations, still consistent, 
however, with the very important results (7) and (8). 


CHAPTER If 


In the previous chapter we developed the Lorentz transforma- 
tion equations on the Relativity hypothesis combined with the 
assumption of constant light velocity for all observers. In 
this chapter we shall apply these equations to certain kine- 
matical problems, whose treatment on classical principles is 
familiar to all physicists. 

If a body is moving with a uniform velocity relative to a 
frame S, it will also move with a uniform velocity relative to 
the frame S’, but the relation between the two velocities is 
by no means so simple as that obtained by the familiar method 
of compounding velocities. 

Idealise the body as a particle, and suppose that it occupies 
the position (%, y, 2) at the time # in the frame S and the position 
(« + dx, y + dy, z+ 82) at the time? + 6¢. Its velocity in the 
S frame has components v,, vy, v, which are the limits of 84/8, 
dy/dt, 52/52.* 

This same particle occupies a position (x%’, y’, 2’) at the time 
t’ in the frame S’ where 


x’ = a(x — ut), 
ve = - 2 =ES a» 
t’ = a(t — ux), 
and a position (x’ + 6x’, y’ + dy’, 2’ + 82’) at the time ¢’ + &’, 
where 
x’ + dx’ = af{(% + dx) — u(t + 82)}, 
y’ + by’ = vy + by; 2’ + 82’ = 2z + 82, 
t’ + Ot’ = af{(t + dt) — u(x + 8x)}, 
so that 


oy == Oy: 02) = 07 
dt’ = a(dt — udx) 


By dividing the first three equations of (1) by the fourth we 
obtain 


6x’ = a(dx — dt) | 
(x) 


* It should be borne in mind that v is the ratio of the velocity of 
the particle to the velocity of light. 
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Vy = (v, — u)/(I — ae 


v,/a(I — wry) 
v,/a(I — uv,) 


(2) 


= 
Hil ll 


Similarly, 


vy, = v,'/a(I — w’v,’) 
Vv; = 0,'/a(I — u’v,’) 


(24) 


v, = (v,’ — w)/(I — mn 


where v,’, v,’, v,” are the three components of the velocity of 
the particle in the frame S’. 

Later we shall see how these equations can be put in a form 
analogous to the equations by which we transform co-ordinates, 
but as they stand they are more suitable for developing a 
graphic method of compounding velocities in a manner con- 
sistent with the Relativity principle. 

The familiar problem of relative motion is the following : 
There are three bodies K,, K,, Ks, such that the velocity of 
K, relative to K, (i.e., as measured from K,) is known in 
direction and magnitude; also the velocity of K; relative to 
K, (as measured from K,) is known. It is required to find 
the velocity of K, relative to K, (as measured from K,). 

The classical solution represents the velocity of K, relative 
to K, by a line AB, and the velocity of K, relative to K, by a 
line BC, and asserts that AC represents the resultant velocity 
or velocity of K; relative to K,; and this would still be the 
correct solution even in Relativity kinematics were the two 
component velocities measured by the same observer or in 
the same frame. But in all cases of composition of velocities 
which are of any physical importance, this condition is not 
satisfied. To solve the problem above, we appeal to equa- 
tions (2). 

Suppose K, to be fixed in the frame S, and K, fixed in the 
frame S’, then if v, is the velocity of K, relative to K,, and 
Vv» is the velocity of K; relative to K,,v; =u= — u,v, =’, 
and the velocity of K; relative to K, is v. 

We make use of (2A) and notice that 


— u'v,f = Uv,' = 04V, cos 9, 


i.e., the so-called ‘‘ geometric” product .of the two velocities, 
or the product of one and the resolved part of the other along 
it. It will be convenient to denote it by the usual symbol of 


vector notdtion, viz., 
(v1. V2). 
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Examination of the equations shows that the relative 
velocity of K, to K, is the ordinary vector sum or resultant of 
the three parts, 


v,/{t + (v,.V.)} parallel to the direction of v,, 
the resolved part of v, parallel to v, 
EVs Vy) 
the resolved part of v, perpendicular to Vv, 
L +: (V,.¥,) 


Hence it is easy to derive the construction given in F ig. 2 
AB represents the velocity v, and BC represents the velocity 


, 


and L079)? 


M 
Fic. 3.—In the diagram AB represents a velocity equal to o°8 the velocity of 
light ; AD represents a velocity 0°6 that of light. 


EM = (x — (‘8)2)} CM = °6 CM 
GN = (1 - (-6)2)! CN = ‘8 CN 
AF = AH = AE/1°48 = AG/r°48. 


V,. From C drop a perpendicular CM on AB. Then the angle 
CE Mia 0: 


Take a point E on CM, such that 
EM = CM/a, = (1 — v,?)* CM. 
Join AE and take a point F on it such that 
AF = AE/{r + (V,. V,)}. 


Then AF represents the relative velocity of the body K, 
tous. 
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It is of vital importance for the appreciation of this result 
to keep in mind the fact that the two velocities v, and v, are 
not measured by the one observer. It is forgetfulness of such 
important features which gives a bias to one’s mind, accus- 
tomed to the older ideas, to regard a result such as this as a 
paradox. Incidentally it will be noticed that the angle @ is 
measured in the frame S’ relative to which K, is at rest. 

It will be a serviceable exercise if we interchange the 
velocities, and suppose that K, moves relative to K, with a 
velocity v, and K; relative to K, with a velocity v,. 

The velocity of K, relative to K, is not the same as before. 
The construction would require us to take the other two sides 
of the parallelogram ABCD, drop a perpendicular CN on BD, 
and take a point G on it such that 


CG — CNi/a,— {7 — 7,7)? CN. 
Choosing H on AG, so that 
AH = AG/{r + (v1. vy)}, 


it appears that AH represents the required relative velocity. 

This latter construction can be viewed in another way, in 
connection with the first arrangement of the velocities, where 
K, was moving with velocity v, relative to K,, and K, with 
velocity V, relative to K,. Reversing this we should say that 
K, was moving relative to K; with a velocity equal and opposite 
to V,, and K, relative to K, with a velocity equal and opposite 
to V,. On carrying out the construction, starting from the 
point C, it is easy to see that the velocity of K, relative to 
K, is represented by a line equal and parallel to HA (in the 
sense of the letters as written). 

This is another of those results which appear paradoxical 
if one forgets the fact that the relative velocity of K, to K, is 
measured in a frame in which K, is fixed, while the relative 
velocity of K, to K, is measured in a frame in which K; is 
fixed. It appears that the observers do not “ get the same 
line for their relative motion ” (as it was once put to the author 
in a crude fashion by a critic). More precisely, angles between 
direction of relative motion and other definite lines are not 
measured alike by two groups of observers. But the result 
which was demonstrated at the end of the previous chapter 
has prepared us to accept such a conclusion as no paradox, 
but quite in accordance with Relativity ideas. 

’ There is, of course, one thing which is measured alike, and 
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that is the magnitude of the relative velocities of K; to Ky 
and of K,toK;. Acontrary result could hardly be anticipated, 
in view of the fact of which we have made constant use in the 
treatment of the S and S’ frames, that w and w’ have the same 
numerical value. In the present instance the result is easily 
proved ; for 
AE? = (v, + v2 cos 6)? + (1 — v4?)vQ" sin?0 
=v? + v2? + 2v,v,cos 8 — (vv, sin @)?, 


and AG? can be shown to be equal to the same function of v, 
and v,. Hence, since AE is equal to AG, so also is AF equal 
to AH. 

In short, although the order of addition of two veloci- 
ties makes no difference to the magnitude of the resultant, 
it does make a difference to its direction. But, to reiterate 
the important fact, it is implied that different observers are 
involved in the measurement of the component velocities. If 
one observer wanted to add the two velocities of two bodies 
both measured in his own frame, he would do so in the familiar 
way by the parallelogram method, but as far as the author is 
aware, such a result has no physical significance whatever. 

In the figure the velocities are pictured as making an acute 
angle one with the other. Were the angle obtuse, (v,.‘V.) 
(= vv, cos 6) would be negative, I -+ (Vv, .Vv.) would be less 
than unity, and AF would be longer than AE. If the angle 
were right E and F would coincide. 

An interesting case arises when Vv, and v, have the same 
direction, so that (V,.V.) =1v,v,. The magnitude of the 
resultant velocity is then 


(01 + Ve)/(I + V2) . « s (2B) 


As a consequence the resultant has a magnitude less than 
the numerical sum of the components. It is interesting in this 
connection to recall the fact that the introduction of velocities 
greater than that of light would introduce imaginary factors 
into the equations of transformation. A glance at the formula 
just written will show that the composition of two velocities 
less than that of light will produce a resultant with a magnitude 
less than the speed of light, for if v, and v, are each less than 
unity, so also is (v, + v,)/(I + v,v.) less than unity. If v, 
is equal to unity, the resultant is unity no matter what value 
Vz has, or the addition of the velocity of light to any velocity, 
equal to, greater than or less than that of light produces a 
resultant equal to that of light. This is true also if the velocities 
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are inclined at a finite angle to each other. Reference to Fig. 3 
will show that if v, = 1,-then E coincides with M, and F with 
B. If anyone were bent on mere mystification, he could make 
a great feature of the result that with both velocities greater 
than that of light the resultant would be less than that of 
light ; for, to be sure, if v, and v, be individually greater than 
unity, (v; + v,)/(I + v,v,) is less than unity, and decreases to 
zero as a limit as v, and v, increase without limit. But this is 
mere symbol juggling, because in assuming velocities greater 
than that of light we have outstepped our conditions of 
transformation, which accepted light signals as the suitable 
means of communication between the two frames for the 
purpose of establishing relations between physical measures. 
As already stated, we shall become acquainted later with 
dynamical reasons for concluding that velocities greater than 
that of light cannot occur under the conditions postulated in 
restricted Relativity. 

We can now return to the equations (2), for the purpose of 
expressing them in a rather more convenient form. It will be 
advisable at this stage to introduce once more the concept of 
the “‘ proper time”’ of the particle, the term introduced by 
Minkowski, who was apparently the first to realise its im- 
portance. By squaring equations (1), we can easily show that 


5t’2 — bx’? — dy’? — §2’2 = St? — dx? — dy? — 82? (3) 


and hence that 
8¢’(r — v’?)t = dt(r — v?)}, 


Supposing, therefore, in the frame S, we integrate the 
expression (I — v?)#df along the path of a particle between 
two defined positions, the result will be the same as a similar 
integration will yield when carried out in any other frame 
moving with a uniform velocity with respect to S, Such a 
quantity which retains the same value in all the frames we refer 
to as ‘“‘invariant.”” The particular invariant in question is 
called an interval of ‘‘ proper time ”’ for the particle. Its suc- 
cessive elernents might be arrived at thus. During any element 
of its path, the particle will be momentarily at rest in some 
one of the set of frames S, S’, etc. Since in that frame the v is 
zero at the moment, the actual element of time measured in 
that frame will be an element of the proper time of the particle ; 
for the fattor (1 — v)t is unity if v= 0. Hence if we denote 
‘the proper time of a particle from any arbitrary position on 


56 RELATIVITY 


its path as 7, then 
Sr = St(r — v?)t = Si/(x — v’*)} = etc. 


It will be convenient to introduce the symbol f to represent 
the quantity 1/(1 — v®)!, which is, as it were, the Lorentz factor 
for the S frame and the (momentary) “rest ’’ frame of the 


particle ; so that 
OL pi=107; 


is an elementary magnitude which is invariant for all the 
frames.* It will be seen that the proper time of the particle 
between two positions is also the separation between the events 
which are its passage through the first and its passage through 
the second position. 

Dividing each side of equations (1) by 67 and going to the 
limit, we have 


dx’ |dt = a(dx/dt — udt/dt) 
dy'/dt = dy|dt ; dz'/dr = dz[dt 
at'/dt = a(dt/dt — udx/dr). 


These equations imply that x, y, z, ¢ and x’, y’, 2’, v’ are 
conceived to be functions of the invariant proper time t measured 
from a definite point of the path of the particle, and we see 
that the four differential coefficients of the four co-ordinates 
of position and time with respect to the proper time are trans- 
formed from frame to frame just as the co-ordinates themselves 
are. It will be convenient to introduce the fluxion notation 
of dots to indicate differential coefficients with respect to the 
proper time. After a little practice, it becomes quite easy to 
guard against confusing the dot with the operator d/dt (or, as 
it really should be written, d/d¢, since in this connection ¢ is 
no longer an independent variable, but just on the same 
footing as x, y, 2, viz., a variable depending on Tt). So we 
re-write the equations just developed thus : 


if = alt = ul) 
y =p, tae (4) 
t’ = a(t — ux) 


The quantities %, y, 2 have the dimension of velocity, but 
they are not the velocities of the particle in any definite frame. 


* No confusion should arise between a and 8. wu and a are used in 
connection with the relative velocities of two frames while v and £ are 
used in connection with the velocity of a body relative to a frame, 
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They are a sort of “ blended” velocity obtained by using the 
elementary displacement of the particle in one frame with the 
corresponding element of time as measured in the (momentary) 
“rest ’’ frame of the particle. 

It is very easy, however, to express these dotted co-ordinates 
in terms of the velocity in a given frame ; for 


6x/d7 = Bdx/dt 


and so 
x = Bv,, 
y = By,, 
z= Bv,, 
and, of course, bp. 


Hence we can write the transformation equations for velocities 
in the form 


(44) 


showing that 
Bu,, Bry, Buz, B 


or v,/(I — v?)?, v,/(I — v?)?, v,/(1 — v?)t, 1/(1 — v2)t 


transform like the co-ordinates, or are “ cogredient ”’ with the 
co-ordinates. 

When we come to the discussion of dynamical principles on 
the Relativity hypothesis, we shall have to deal with the accelera- 
tions of bodies in a frame, and shall require the transformation 
equations for the accelerations of a given body in different 
frames. The explicit form of these equations is somewhat 
cumbersome, but they are implicitly contained in equations 
which are directly derived from (4) by a further differentiation 
with respect to the proper time. For just as differentiation of the 
four space-time co-ordinates of a particle with respect to the 
proper time gives a tetrad of quantities cogredient with the 
co-ordinates, i.e., subject to the Lorentz transformation, so 
a differentiation of this tetrad with respect to the proper time 
will give a further tetrad of cogredient quantities. Indeed, 
it is this property which makes proper time such a very service- 
able concept, viz., the derivation by differentiation with respect 
to it of successive tetrads of quantities cogredient with the 
co-ordinates, and this property clearly depends on the invari- 
ance of tie element of proper time, no matter the frame in 
which one elects to measure it, 


58 RELATIVITY 


rea as 
pie ery aa : ats) 
t’ = a(t — ux) j 


which, as stated, contains implicitly the relations between 
accelerations in each frame of the same particle; of course, 
x, V, 2, or d*x/dr*, etc., are not acceleration components in any 
frame, any more than 4,9, 2 are velocity components in a 
frame. But it is easy to express them in terms of the true 
accelerations in a frame. To begin with 


i = dB/dt = BdB/dt = Bd(x — v?)?/dt 
= $Bdv?/dt + (1 — v*) 
== P4(0,4, + Vyay + 0,a,) . : - (6) 
where ay, ay, a, are the components of the acceleration, for 
dv?|/dt = d(3v,2)/dt = X(vxdv,/dt) 


The bracketed expression in (6) is the geometric product of 
velocity and acceleration, and may for convenience be denoted 
in future by the usual vectorial symbol (v .a), so that 

P= pt(v <a) : : . (6A) 

Turning now to x, y, z, we have 

# = Bd(Bo,) (al 
= Pdv,/dt + Bv,dB/dt 
= Baz + Bv,(v . a)} 
9 = Bay + Prvy(v . a)} 
z= BXa, + Prv(v.a} .  . . (7) 

If we write 5 to denote the quantity 6?(v . a), we see that the 
four functions of velocity and acceleration, 

B*(ay + bvx), Bay + buy), B%(az + bz), B% 


are cogredient with the co-ordinates. Hence for the purpose 
of transforming accelerations from the frame S to the frame S’, 
we have as a convenient form of (5) (or rather (5) with the sys- 
tems interchanged, i.e., the equations reciprocal to (5)) : 


aB’*{a,’ + b'(v ’— u')} 
Bay + bo," ae: 


B’2{b'(r — u'v,’) — u'a,'} 


similarly, 
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_ Equations (8) can be applied to a special case of some 
importance. Suppose the frame S’ to be the momentary rest 
frame of the particle, so that v’ = 0, then the acceleration as 
measured in that frame is called the “‘ rest-acceleration.”’ (The 
concept has been used in pre-Relativity days in connection 
with what has been called the ‘‘ rest-mass”’ of an electron, 
1.e., the quotient of the force exerted on it by its acceleration 
when its velocity is vanishingly small.) Since S’ is the rest- 
frame of the particle at the moment, it follows that 


~ 


o — 0,1 = 7 and 0 = 0, 
— + — , , — — 
Also, 0 = Uz, Since ty = Pv,’ [Bp = 0 = 9; 
u=—u=—v 
i 
and 0 == BE ,, 


Hence by (8) 
Ba, + Biv?a, = Bay’ 


which reduce to 


Pa, = ey 
Bray = a,’ 
Ba, a az! 


(the first and fourth of the above being identical), or 


Ay = a, (I — v?)! 
(9) 


Ay = dy (I — v*) 
Aa O40") 


It should be noted that our choice of axes makes OX 
parallel to the direction of relative motion of S and S’. Hence 
that axis must also be parallel to the direction of motion of 
the particle at the moment in S. So we can express the result 
(9) independently of any axes by stating that 
the tangential acceleration in S = (1 — v?) x tangential rest- 

(acceleration, 
and the normal acceleration in S = (1 — v?)! x normal rest- 
: {acceleration. 


It is one of the first exercises presented to a beginner in 
kinematic3 to study the rectilinear motion of a body with 
uniform acceleration, It is not without interest to work out 
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corresponding results for the corresponding problem in Relativity 
kinematics, viz., the rectilinear motion of a body which has a 
uniform rest-accelerationinaframeS. Write g for this constant 
value, and assume that the motion is parallel to the axis OX. 
Then in (9) we put a,’ and a,’ equal to zero, and a,’ = g. 


Hence 
ty = =O 
and ay = g(t — 0}, 
Ke., dujdt = g(r — v*)? ; : A 820) 


Integrating, we obtain 
v/(x = v)t = gt, 


choosing the origin of time to be the instant at which the body 
began to move. 


Hence 
axfdt =v = gi/(x + g7t)t : Poa 9) 


Another integration yields 
gx = (I + g*t*)t 


if we choose the origin O to be such that the body began to 
move from the point * = I/g. 


Since 
a? —- [4 == (ip)? : Aa) 


it is clear that if one constructs a distance-time graph of the 
motion in the usual way, the graph is not the parabola of the 
uniformly accelerated motion of classical kinematics, but an 
equilateral hyperbola, whose semi-axis is 1/g, and therefore 
varies inversely as the value of the rest-acceleration. 

It is of interest to note that by (10) the value of the actual 
acceleration in the frame S decreases to zero as v approaches 
unity (i.e., the velocity of light). Equation (11) also shows 
that unity is the limit of v as ¢ approaches infinity ; and (12) 
shows that as x and ¢ approach infinity, their difference 
approaches zero, a fact which is also embodied in the graph, for 
at great distances along either branch of a hyperbola there is 
approximate coincidence of the curve and its asymptotes. 

The following three results are easily verified : 


be (ee )s se 
a) (C42) e ae ; . (13) 
(Reena 


It is not unnatural for anyone glancing at these equations 
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for the first time to be a little puzzled by their apparent lack 
of consistency with the well-known rules concerning dimensions 
of physical quantities. (The second of (13), e.g., looks “‘ upside- 
down.”) This arises, of course, from our special choice of 
units. It is clear that since time is expressed in units which 
are I/c of a second, accelerations will be expressed in units 
which are c? times larger than the usual C.G.S unit of accelera- 
tion. The simple “ dodge’ for converting all our kinematic 
formule in relativity units into formule in ordinary (C.G.S.) 
units is to replace 


tin the relativity formula by ct, 
wuandv ,, » » » Ue, v{C, 
a and ix ” ” ” ? ajc?, gic. 


Hence in usual units the results (10), (11), (12), and (13) 
above would become 


u 


dv/dt = g(x — v?/c?)3 : ; .  (IOA) 

v = cet/(gt? + c%)t . = 3 (EDA) 

Sas oat hl (ahd he ; : : » (ZA) 
ee ee HE) 

(De Or 2 ; (134) 
a= Gg fo od | 


The constantly recurring feature of Relativity that the 
velocity of light is a limit for all physically possible velocities 
leads us to make note of a fact which, in a generalised form, 
plays a very important part in gravitational theory It is 
clear that for a given differential increment of the time in a 
frame S, the corresponding increment of proper time of a body 
decreases steadily to zero as the speed of the body approaches 
that of light, for 

or = dt(I — v?). 


This is simply pushing to its logical conclusion the result 
obtained in the previous chapter, that events taking place in 
a frame S’ appear to “‘ go slow”’ to observers in S, and to 
“slow up” more and more as the relative speed of S’ to S 
increases ; so much so that if the limiting velocity of light were 
attained, there would be no progression of events in S’ at all 
as far as observers in S could see, assuming that they could see 
at all in such wildly impracticable circumstances. Professor 
Eddington puts this in a picturesque way when he says: “ It 
man wishes to achieve immortality and eternal youth, all he 
has to do is to cruise about space with the velocity of light. 
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He will return to earth after what seems to him an instant to 
find many centuries passed away.” 

This can also be seen by reference to the result proved 
earlier in the chapter, that addition of any velocity to the 
velocity of light in a given direction alters neither the magnitude 
nor direction of this latter velocity ; which means that if a 
body had the speed of light relative to an observer in a given 
direction, all bodies moving about over this body would have 
the same velocity to this observer, i.e., would appear to be 
at rest on it to this observer. So “‘ nothing would happen ”’ on 
the body for this observer. 


APPENDIX TO CHAPTER II. 


As pointed out in the first chapter, although optical effects 
depending on the motion of an observer through the ether have 
not been rendered evident in physical experiments, there are a 
number of such effects which depend on the relative motion of 
the observer and the source of light. Three of these can be 
very readily treated by means of the formule and results de- 
veloped in the last two chapters. 

Consider, for example, an observer situated near the origin 
of the frame S, viewing a distant source of light situated at a 
point P, such that the direction cosines of OP arel, m,n. The 
part of the wave train which reaches this observer is practically 
a plane wave with direction of propagation, —/, —m, — n, 
and the light vector is a harmonically varying magnitude which 
varies as the expression 


sin p(t + lx + my + nz) 


where # is the pulsance (27 X frequency) of the light, assumed 
to be monochromatic ; the frequency is, of course, expressed 
in vibrations per 1/c second. Now consider the frame S’ and 
an observer fixed in it, also near its origin, and let us elicit how 
he would measure the same element of the wave train.* We 
can do so by substituting for x, y, z, ¢ their values in terms of 
x’, y’, 2’, t’. This shows us that the S’ observer finds the light 
vector varying as the harmonic function 


sin p[a(t’ — w’x’) + la(x’ — u't’) + my’ + nz’), 
Te: sin p’(t’ + Ux’ + m'y’ + 1’2’) 


where 
p’ = a(t + lu)p 
i = (}+a)/(I + P| (x) 
“Wn = 


m/a(xi + lu) 
nja(i + lu) ~ | 


The last three of these equations embody the well-known 


* We are assuming the observations to be made when the two 


origins are near one another. 
63 
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aberration of light. The frame S is supposed to be fixed with 
respect to the sun and the frame S’ to the earth. For a short 
interval of time such frames could be regarded as in uniform 
relative motion, « being the speed of the earth in its orbit. 
The angular co-ordinates of a distant star, as measured by an 
observer on the earth and an imaginary observer fixed near it 
on the earth’s orbit, would not agree; their cosines would be 
related as indicated in equations (1). In the particular case 
when the star is situated, say, on the axis OZ, so that in the 
frame S the light rays are perpendicular to the direction of 
motion of S’ relative to S, the result becomes very simple. 
Since 1 = m = 0 and ” = I, we have 


Poesy it = ON == Ela. 


So the light makes an angle @’ with Z’O’ in the frame S$’, 
such that 
tan 6’ = au. 


Classical theory yields in this case the result that the angle 
of aberration is arc tan wu. As u is of the order I/10,000, and 
therefore a about I — 10~§8, astronomical observations are 
incapable of deciding between the two results. 

But there is one difficulty in aberration theory which Rela- 
tivity surmounts readily—very naturally so, since its object 
was to give a comprehensive explanation of this and similar 
difficulties. As long ago as the middle of the eighteenth century 
Boscovich had suggested the use of a telescope filled with water 
in aberration experiments. His suggestion was never carried 
out until Airy in 1871 made some observations on the star y 
Draconis with such a telescope, at the Greenwich observatory. 
But in the interval of a century there had been a very searching 
discussion of the possible results of such an experiment. Some 
physicists believed that owing to the slower speed of the light 
through the water, the angle of aberration as measured by 
such a telescope would be larger than that measured by an 
ordinary telescope whose tube contained air, the relation 
between the two measures depending on the index of refraction 
for air and water. Fresnel, however, in endeavouring to dis- 
cover an explanation for an experiment performed by Arago, 
was led to propound a hypothesis concerning what is called in 
theories of the ether the ‘“‘ dragging coefficient of matter on 
ether,’ and on applying this hypothesis to Boscovich’s hypo- 
thetical experiment, predicted that the material filling the 
telescope would not affect the angle of aberration. This was 
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in 1818. But although in the meanwhile Fizeau had (in 1851) 
tested, with quite satisfactory results, Fresnel’s hypothesis by 
another experiment, to which we shall refer presently, it was 
not until 1871 that his prediction concerning the aberration as 
measured by a water-filled telescope was verified by the work 
of Sir G. Airy. Of course, this independence of the angle of 
aberration and material filling the telescope tube is just what 
would be expected on the relativity view. The direction 
cosines of the light rays in the S’ frame are — 1’, — m’, — n’, 
and are determined entirely by the cosines in the S frame and 
the relative velocity of S’ and S. If an observer wishes to focus 
an image of the star on the cross-wires of any symmetrical 
optical instrument, he naturally will lay the axis of the instru- 
ment along /’, m’, n’, no matter what transparent media fill the 
space between the lenses or mirrors of the instrument. 

Returning to equations (I), we deduce that the frequency 
ot the light in the S’ frame is not the same as that in the S. 
This is the familiar D6éppler principle, but with some variations 
from the customary result. Let us take two special cases so 
as to elucidate the difference. 

If the light is being propagated opposite to the relative 
motion of S’ to S, then] =randm=n=o0. 

Hence 


pi = oft + up = pla + w(x — wh. 


If the propagation is in the same direction as the relative 

motion of S’ toS,/ = — 1, m= = 0, and 
p =a(r — up = pir — w)t/(x + wh. 

Now classical ether theory yields somewhat different results. 
Thus it would discriminate between the case where the source 
is stationary in the ether and the observer moving towards it 
with a velocity u, and the case where the observer is at rest 
and the source approaching him, in the first case giving £(I + 4) 
as the altered frequency, and #/(1 — u) in the second. Rela- 
tivity theory makes no such discrimination, and gives a value 
p(x + u)t/(x — uv)? intermediate to the previous two. As the 
three results only differ in squares and higher powers of u, 
observation is not sufficiently refined to give preference to one 
more than another. When the observer and source are receding 
from one another, the corresponding values are 


p(t — 4), P(r +), P(x — wa/(x + wt. 


_If we also consider a beam of light propagated at right 
D 
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angles to the direction of relative motion, so that / = 0, then 


p = op, 

or there would be altered frequency, the difference being of the 
second order in the ratio of the velocities of relative motion 
and light. As far as the author is aware, no verification of this 
result has been attempted, although it has been suggested that 
with great care a definite result might be obtained by analysis 
of the light emitted by the positive rays in directions perpen- 
dicular to the cathode channel. 

The third effect coming under review is the so-called 
“dragging effect”? of moving transparent media on light, 
referred to above. 

The propagation of light in a material medium, which 
is itself in motion relative to the source and the observer’s 
instruments, is a problem which has evoked a great deal of 
discussion and controversy even in the days of the elastic-solid 
theory of the ether. When optical theory became merged in 
general electromagnetic theory, the matter became still more 
involved; and, indeed, it was the labours of Lorentz and 
Larmor to evolve order out of this mass of uncorrelated material 
that led directly to the position in which the formulation of an 
exact relativity principle became possible. When the concept 
of the ether as a medium of propagation was ousting the cor- 
puscular theory from the position of privilege which it had 
occupied in works on natural philosophy, there were two 
extreme hypotheses regarding propagation in moving matter 
which were equally plausible. One was that the motion of the 
matter produced no effect on the propagation of the light 
through the ether, i.e., to choose a definite case, if the velocity 
of light in the ether is c cms. per sec. im vacuo, and therefore 
c/w cms. per sec. in a transparent body at rest in the ether 
(u being the refractive index), then the velocity relative to the 
ether is still c/w cms. per sec., even if the body is moving through 
the ether, and so is c/w — v, or c/w + v, cms. per sec. relative 
to the body, according as the motion of the body is in the same 
or in the opposite direction to that of the light. The other 
extreme view regarded the light as travelling with the same 
velocity with regard to the body, c/u, as when at rest; but 
with a velocity c/u + v, or c/w — v, with regard to the ether. 
On the former view the body was considered to have no 
“ dragging effect ’’ on the ether, while on the latter it was 
conceived to have a “ coefficient of convection ” equal to unity. 
Owing to the special views which Fresnel entertained concerning 
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the elasticity and density of the ether im vacuo and in matter, 
he was led to propound a view intermediate to these extremes. 
He proposed a “ convection-coefficient ’’ equal to 1 — r/p?, 
ie., a velocity relative to ether of c/u + v(I — 1/2), and to 
the body of c/u — v/u*, where the body and light move in the 
same direction; by means of this formula he succeeded in 
accounting for an experimental result which had just been 
communicated to him by Arago, and at the same time predicted 
the independence of aberration to the material filling the 
telescope, a prophecy verified fifty years later by Airy, as 
recounted. In 1851 Fizeau made what seems, at the first 
glance, to be a direct verification of Fresnel’s hypothesis, but is 
really an illustration of a purely relative phenomenon with no 
direct bearing on the ether question at al]. What Fizeau 
recorded (and his work was repeated with further refinements 
in 1886 by Michelson and Morley, with quantitative results 
much more precise) was that light travelling in flowing water 
had, relative to the pipe in which the water flowed, and to the 
observer’s apparatus stationed at one end of the pipe, a velocity 
neither c/u nor c/w + v, but a velocity c/u + xv, where « is 
a fraction less than unity. The comparison of « with 1 — 1/y?, 
effected by the work of the two American physicists, was very 
satisfactory. On account of the agreement of Fresnel’s con- 
vection-coefficient with experiment, it became in later years a 
desideratum for any form of electromagnetic theory to deduce 
this coefficient, at all events, to an accuracy of the order of v/c, 
if it expected to command attention. Indeed, so long as effects 
of the first order only were in question, this simple result of 
Fresnel’s was quite sufficient to account for the failure of 
velocities relative to the ether to condescend so far as to allow 
themselves to be measured. It was only when Michelson and 
Morley’s “ ether-drift’’ experiment showed that this lack of 
condescension extended even to second-order effects (depending 
on v2/c?), that Fresnel’s coefficient became inadequate as an 
excuse for the ether’s bashfulness. To those interested in the 
historical development of pre-relativity views, accounts are 
obtainable in standard English text-books on Optics, or in 
Whittaker’s ‘‘ History of Theories of the Aether.” It is interest- 
ing to note that the theory of a mobile ether, which played an 
important part at one time and was developed by Stokes, but 
was abandoned by later physicists, has been recently reintro- 
duced into discussion (with no great conviction on the author’s 
part apparently) by Dr. Silberstein. Ina paper to the February 
“Phil. Mag.” of 1920, he points out that recent results on the 
5 * 
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velocity of light near gravitating matter gives some plausibility 
to an hypothesis concerning condensation of the ether near 
celestial bodies, which Planck suggested some years ago as a 
necessary addendum to Stokes’ theory. To those who wish to 
consider ‘“‘a last glimpse of hope for the banished medium,” 
a perusal of this paper will prove interesting and helpful. 

To return, however, to Fizeau’s experiment, the reader will 
see that in it (as in all experimental work on this vexed question 
yielding so-called positive results) there is no direct question 
of ethereal velocity involved at all. Here, as always, measured 
velocities are velocities relative to matter—in this case, the 
water and the pipe. The treatment of this problem on the 
relativity basis is an excellent illustration of Einstein kine- 
matics. The water is at rest in a frame S’, and the velocity of 
light through it is 1/u (the velocity 7m vacuo being unity) ; 
the frame S’ is in motion with the velocity u along OX, relative 
toS. The light being propagated in S’ along O’X’, its velocity 
relative to S, which is supposed to hold the pipe and observer’s 
apparatus, is obtained from equation (2B) of this chapter by 
putting v, equal to wu, and v, equal to I/u. The result is 


(w+ I/p)/(I + u/p). 


If we neglect squares and higher powers of u, this is approxi- 


mately equal to 
(u + I/u)(I — u/yu) 
= I/u + “(I — I/p*), 


which is Fresnel’s result. 

This note would be incomplete, however, if it failed to 
emphasise once more the totally different standpoints of Rela- 
tivity and earlier theory. On the basis of a fixed ether, physi- 
cists deduced that certain optical effects depending on relative 
motion of matter and ether should manifest themselves. 
Failure on the part of these to exhibit themselves led to the 
introduction of additional hypotheses such as that of a convec- 
tion-coefficient for first-order phenomena, and a contraction- 
coefficient for second-order, which produced agreement between 
observation and theory, so long as observation could not be 
pushed beyond a certain degree of precision—a not altogether 
satisfactory state of affairs. Relativity accepts the absence 
of these expected phenomena as an exact and fundamental 
natural law, and not as the result of a peculiar conspiracy on 
the part of matter to hide the blushing ether from our view. 
It treats the matter in its own particular and mathematically 


RESTRICTED RELATIVITY 69 


simple way, and finds formule which naturally imply these 
negative results, and which the thorough-going relativist 
accepts as exact and verifiable by any experiment conceivable, 
however precise. 

There is a celebrated experiment carried out by Sagnac 
which should perhaps be mentioned here, although it has no 
real bearing on the restricted Relativity which we are now 
discussing.* It used, however, to be referred to as in contra- 
diction to the hypothesis of Relativity. A pencil of light 
issuing from a source is divided in two by the usual devices and 
one-half is made to travel round the perimeter of a circular disc 
in one sense by means of mirrors suitably placed and tangential 
to the circumference, the other half being sent the other way. 
The two partial beams, after their respective journeys, meet 
once more and interfere to produce a set of fringes. The disc 
is now rotated, and there is a displacement of the fringes. 

Now as the disc had a radius of 25 cms., and was turned 
only at the rate of two revolutions per second or thereabouts, 
it should be obvious even to a beginner that the second-order 
effects, with which Relativity is primarily concerned in such 
experiments, have no place here at all. The result is purely a 
first-order effect, and the reader will find an elegant treatment 
of it as such in the “ Comptes Rendus ” (Nov., 1921), by Lan-’ 
gevin. But, in any case, the restricted Relativity principle con- 
cerns itself alone with uniform motion; it has nothing to say 
on the relativity of rotation and acceleration. That will come 
up in the second part of this volume, and be discussed there. 


eG. 157 (x9r3), pp. 708 and’ 1410, “J. de Phys.” (5) 4, 
(1914), Pp. 177. 


CHAPTER III. 


So far we have been concerned with the relations which exist 
between space and time co-ordinates in two frames, S and S’, 
in uniform relative motion to one another, and between kine- 
matic measures in these frames. For this purpose we have 
only required the so-called second postulate of Relativity, the 
invariance of the velocity of light. We now come to considera- 
tions where the first postulate makes a direct appearance, the 
invariance of the expressions of the laws of nature. 

We are, in this connection, concerned not with verbal state- 
ments of these Jaws so much as with the equations which are 
the mathematical expression of these laws. All science of an 
exact nature professes to deal with measured quantities, and 
to find uniformities and sequences in these measures. In 
physics the main object in studying any natural phenomenon 
is to find the manner in which a measured quantity in one place 
and at one time is related to its measures at other places and 
times. To this end any theory which professes to be true must 
lead to differential equations, the solution of which would 
presumably determine such relations and determine them 
correctly, i.e., in such a way that they would agree within the 
limits of experimenta] error with all observations in whose 
precision we have the necessary confidence. We then say that 
a law is “ true to nature.” But if we accept the standpoint of 
Relativity, we subject such laws to a second test. In the 
differential equations, the variables with regard to which we 
differentiate the symbols which represent the measurable 
quantities are the space and time co-ordinates. At once the 
question of a frame of reference arises, and in relativity no 
privilege is given to one frame above another. The form of 
the differential equations must be the same in S as in S’; a 
solution would yield a function of x, y, z, ¢t, which would fit 
with the measures of the quantity in S, and exactly the same 
function of x’, y’, 2’, ¢’ should fit with the measures in S’. 

It has been pointed out in the Introduction that this test of 
Relativity would be satisfied by the Newtonian laws of motion 
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if the relations between the co-ordinates and time in the two 
frames were embodied in the equations 


But, as was shown, these equations are inconsistent with an 
equal velocity of light in the two frames ; moreover they do not 
allow the very important and fundamental equations of electro- 
magnetic theory to assume the same form in the frame S as they 
do in S’ without introducing, as Lorentz did, hypotheses con- 
cerning contractions accompanying motion through the ether, 
and such concepts as “ effective co-ordinates ”’ and “ local time.”’ 
On the other hand, if one uses the Lorentz transformation, the 
electromagnetic equations pass the relativity test, but the 
Newtonian equations of motion do not. This, at once, raises 
in an acute form the question of the universal validity of 
Newton’s laws. To some minds such a challenge seems almost 
blasphemous. In the early days of Maxwell’s theory of the 
electromagnetic field, it would not have been considered for a 
moment as a possibility; but from the year 1881, when 
J. J. Thomson first introduced the idea of the ‘“ electromagnetic 
inertia’ of a moving charge, theory and practice have com- 
bined gradually to accustom men’s minds, not to an abandon- 
ment, but to a revision and generalisation of the Newtonian 
laws. In particular, it was perceived that the experiments on 
high-speed electrons compel physicists to relinquish the notion 
of invariable mass. It is true that the added electromagnetic 
mass can be associated, if one likes, with that dragging along 
of- Faraday tubes, of which the magnetic field of a moving 
charge is conceived to be the outward and visible manifestation. 
Nevertheless, an attitude of scepticism was more and more 
observable, and the relativist was really rendering a signal 
service in bringing the matter to a head, and in offering for 
consideration a set of equations to which Newton’s are an 
approximation in certain limiting conditions, which satisfy the 
relativity test, and which are true to nature under conditions 
for which Newton naturally had no experimental data to go 
upon. ' : 

: In the Newtonian scheme, the differential equations for a 
body are derived from the statement that the force is propor- 
tional to and codirectional with, the rate of charge of momentum, 
which leads to the differential equations 
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d(mdx/dt) /dt = F,, 
and two similar equations ; 
or max/di? = F,, 


and two similar equations. 

An essential feature of the Newtonian system is the absolute 
constancy and invariability of m for the body, so that m has 
the same value for all positions and speeds of the body. A 
further feature is the nature of the quantity represented by 
the symbol appearing on the right-hand side. A great deal of 
discussion has raged around the nature of force and the possi- 
bility of action at a distance as contrasted with transmission 
of stress through a continuous medium, but for our immediate 
purpose the symbol F stands for a quantity depending on the 
relative position of the body to other bodies said to be “ exerting 
force”’ on it; that is, it is a function of the co-ordinates of the 
other bodies relative to a definite point in this body as origin. 
At least, that is so in the case of conservative forces. In 
motion involving dissipation of energy as heat, EF would also 
include terms depending on the relative motion of the body 
and the other bodies referred to. 

To sum up, we equate two vector quantities, one involving 
the second differential coefficient of a radius vector with 
respect to time, and the other involving no such differential 
coefficients, or, at most, first differential coefficients ; so that, 
given sufficient information about the position and motion at 
a given instant (the “initial conditions ’’), and the manner in 
which F depends on position and motion, we can evolve by 
solution the manner in which x, y, z depend on ¢, and obtain the 
path of the body, idealised, of course, as a particle concentrated 
at its centre of mass. In fact, ¢ is the independent variable 
upon which x, y, z depend, and the object of the solution is to 
express these latter symbols as functions of ¢. 

As already stated, the possibility of retaining an invariable 
m in the equations as a perfectly general condition has been 
seriously questioned for some time, even before the enunciation 
of the Relativity principle ; and, in any case, they cannot be 
made to square with relativity, if one retains the Lorentz 
transformation equations. For if the above equations are true 
in the frame S, then it is not true inthe frame S’, that 


marx di? =F 0 etc, 


where F’ is the transformed F, the Lorentz equations being 
employed in the transformation. 
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From the standpoint of Relativity, we must endeavour to 
suggest equations of motion which will transform unchanged 
in form; and, of course, as Newton’s equations have passed 
unchallenged so long in conditions where the material velocities 
are small compared to that of light, such suggested equations 
must degenerate towards the Newtonian form as a limit as the 
velocity approaches zero. Attention should be carefully 
focussed on this procedure ; it is typical in Relativity theory. 
Relativity does not deny the possible validity of well-attested 
laws as extremely good approximations in conditions of com- 
paratively slow motion. It gladly welcomes such knowledge as 
a very necessary, and, indeed, indispensable, help towards the 
discovery of more general laws which will accord with the 
principle, and which (it is hoped) will turn out to be true to 
nature under circumstances not hitherto contemplated. The 
problem is, in fact, twofold: in the first place, existing know- 
ledge has to be generalised, accordance with Relativity being 
a test to which all suggested generalisations must conform ; 
in the second, experimental work must be carried out where 
possible on the validity of the suggestions under broader con- 
ditions. We can illustrate the procedure admirably in the 
present instance. 

In the first case, it has been noticed that in the Newtonian 
equations, ¢ is the independent variable. Now, it is a feature of 
Relativity that the time measured in a given frame, and the 
space co-ordinates in that frame, are much more closely linked 
than before. So close is the union, that there is, as stated, a 
strong impulse in certain quarters to regard time and space as 
a unity which we split in two for our own immediate convenience, 
but with no philosophical justification. Be that as it may, it 
is no longer possible in the differential equations conforming 
‘with Relativity to regard ¢ as an independent variable ; it must 
be put on the same footing as the space co-ordinates 4%, y, 2, 
and regarded as a dependent variable. But dependent on 
what? It is here that we begin to perceive the extreme 
importance of the concept of proper time introduced in the 
previous chapter. As pointed out there, an interval of proper 
- time measured from a definite point-instant to another definite 
point-instant is the same in all frames; it is invariant. It 
retains, therefore, sufficient of the absolute nature that was 
formerly postulated for time, measured anywhere, to render it 
suitable as the independent variable in our differential equa- 
tions. Further, in using the convention of splitting vector 
displacement into three Cartesian components, we have hitherto 
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employed groups of three equations. Now, we must be pre- 
pared for groups of four, since differential equations must 
contain the coefficients of x, y, z and ¢ with respect to Tr. 
Let us try 
d(udx/dt)/dt = P, 
d(wdy/dr)/dt = Py (x) 
d(udz/dt)/dt = P| : ; ; 
d(udt/dt)/dr = P, 


Concerning these, a few remarks are necessary before 
proceeding. 

i. We must endeavour to keep out of mind at the moment 
terms like mass and force as used in the Newtonian theory. 
So as to give no unconscious bias in that direction, the symbols 
m and F are not employed at this stage. The terms will 
be reintroduced later, but their definition as used in relativity 
arises directly out of the suggested equations, and they are 
not regarded as entities having a meaning apart from the 
equations. 

2. There are four equations. 

3. The proper time 7 is the independent invariable. 

4. » is regarded as an invariant quantity for the particle 
whose path in a given frame is implicit in these equations. It 
may differ for different particles. But for a given particle, it 
is at a given point-instant a quantity measured alike by all 
observers ; if it is » in S, p’ in S’, p” in S”, and so on, then 
i —p =p = ete., so that the use of accents in connectiog 
with it is quite unnecessary. This assumption corresponds to 
the assumption of constant mass in Newtonian dynamics, but 
must, on no account, be confused with it. It is a much more 
general assumption, as will be seen later, and for that reason 
we shall endeavour to keep the issue clear by giving it no name 
at present, least of all ‘‘ mass.’’ Further, it may be as well to 
point out that “invariance”? must not be confused with 
“invariability ” or “ constancy.’’ At a given point-instant for 
the particle, all observers use the same value of » ; that assumes 
invariance. At another given point-instant all observers will 
again use the same value of ~—invariance once more; but 
there is no assumption at present that the first » and the second 
p shall be equal—that would be assuming constancy or invari- 
ability. To put it another way, it is quite possible that p 
might be a function of x, y, z, t—at least, that possibility is not 
ruled out in the assumption of invariance. If the particle is 
at the point-instant whose co-ordinates are (x, y, z, #) in S, and 
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(x’, y’, 2’, t’) in S’ (connected by the Lorentz equations), then 
w(%, Y, 2, t) = p(x’, y’, 2’, ). 
But if (%1, 1, 21, t;), and (%2, 9, 2g, #,), are the space-time co- 


ordinates of two point-instants of the particle in one frame S, 
then there is no assumption so far that 


MX Vay 2 b1) = W(%2, Ya Ze, te). 
The absence of any assumption as to the constancy of p is 
implicit in the form we have given the left-hand side of the 
equations (I), where d(udx/dr)/dr is written, and not pd?x/d7?. 

5. Px, P,, P;, P; are assumed to be functions of the co- 
ordinates and velocity of the particle; but again, we refrain 
from introducing the term “ force”’ at this stage. 

6. The first three equations are clearly sufficiently similar 
in form to the Newtonian triad to pass for a formal generalisa- 
tion of them, into which we can reasonably hope to read a 
definite meaning (true or otherwise), but the fourth looks 
decidedly peculiar at a first glance. It will turn out, however, 
to be of first-rate importance, an old friend in disguise, in fact ; 
but for the moment we shall fix our attention on the first three. 

The first thing to satisfy ourselves about is the test of 
transformation. If the equations do not satisfy the first 
postulate of Relativity, they are of no interest to us as relativists. 

In considering the transformation from S to S’, we have 
by equations (4) and (5) of the previous chapter (remembering 
the invariance of p), 


d(pdx' |dr) [dr = a(P, — uP;) 
d(udy' |/dr) [dr = 

d(udz' |dr) [dr = P. 

d(padt’ |dr) [dr = a(P; — uP,). 


Hence for the invariance of the equations it is necessary 
that the measures made in S’ of the quantities involved in the 
right-hand side of the equations (1), viz., P,’, Py’, P,’, Pz’, be 
related to the measures in S by the equations 


BigP = P,) 
Py Pee; 
ieee = a(P; ae UP ,) 
and of necessity also by the equations © 
Ps — OE en wu’ P;') 


a ey P,=P, 
P; = a(P;’ = WP‘) 
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In short, the quantities P,, P,, P,, P; must be cogredient 
with x, y, 2, ¢. This is a condition we shall have to bear in 
mind and return to when we elicit, as we shall do shortly, some 
more definite information as to the nature of these quantities. 

Passing on to the question of “truth to nature,” we first 
find the form to which these equations degenerate if the 
velocity of the particle in a given frame S is taken to be com- 
paratively small, i.e., if v is a vanishingly small fraction. 

Remembering that d/dr = Bd/dt, we see that 


d(updx[d)idt= PB . . . (3) 


and two similar equations. Now as v approaches the value 
zero, B approaches unity, and dB/dt (which = d(1 — v?)~#/dt = 
B®vdv/dt) approaches zero. 

Hence the equations (3) degenerate to the form 


d(udx/dt)|dt = P, 


and two similar equations, and these are the Newtonian 
equations provided we interpret pw as the “mass” of the 
particle, and P as the “force exerted on it.” So our 
suggested equations satisfy Relativity if equations (2) are 
satisfied, and they contain the Newtonian equations as a 
limiting form. It remains to see if they are in agreement 
with any known experimental results concerning bodies travel- 
ling with comparatively high speeds. Recent observations on 
the deviation of electrons by electric and magnetic fields are 
decisively in favour of the view that if the force acting on the. 
electron is calculated in terms of the electric and magnetic 
intensities of the applied fields in the manner suggested by 
electromagnetic theory, then the measure of the electron’s 
inertia (i.e., the factor which multiplies the velocity in the 
calculation of momentum) is not a constant, but increases very 
markedly as the velocity increases, and the quantitative results 
agree, even up to speeds as high as -8 of the speed of light, with 
the assumption that the inertia varies inversely as (I — v2), 
v being the ratio of the electron’s speed to that of light. 

Now it is clear that this conclusion is contained in equations 
(3), provided we interpret the expressions P,/B, P,/B, P,/B as 
the “ force.”’ Suppose, then, using the symbols F and F,, F vale 
for “ force” and its components, we rewrite (3) as 


d(mdx/di)/dt =F, . : » (3A) 
and two similar equations where 
m= wB = p/(r — v?)A, 
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We may refer to m as the “‘ mass ”’ of the particle, but there 
is no constancy about it. Still, there is nothing in these 
equations contrary to any known facts. Assumption of a 
constant » will cover a]l known results. It is customary to 
refer to this quantity as the “‘ rest-mass,” for it is the limit of 
the ‘“‘ mass’ as v approaches zero.* What is most striking is 
the ease with which the Relativity principle, in the character of 
a test to which equations of motion must conform, arrives at 
a result as regards inertia im general, which had already been 
worked out from general electromagnetic theory by Lorentz 
(with the aid of special hypotheses) for the electron, and which, 
as stated, has been amply verified by experience. It must be 
borne in mind that we are not dealing here with mass as a 
“quantity of matter’ or any other vague concept. The word 
is used in the sense of “inertial mass,” i.e., a factor in 
“momentum,” which, in its turn, is the quantity whose rate 
of change per unit of time measured in a given frame is equal 
to the “ force,’ which is another quantity supposed to be 
calculable by definite and experimentally-justifiable rules. 
With many of these rules we are all familiar, e.g., the well-known 
inverse square laws for purely gravitational, electrostatic, and 
magnetostatic observations. But in this connection a word of 
warning must be uttered. A few paragraphs back, attention 
was drawn to the fact that the quantities P,, Py, P,, P; must be 
cogredient with x, y, z, ¢ if our suggested equations of motion 
are to lie within the bounds of the Relativity hypothesis. Con- 
sequently, whatever be the rules by which we calculate forces, 
it follows that BF,, BFy, BF,, and P; must be cogredient with 
the space-time co-ordinates. ‘‘ But,’’ one may naturally ask, 
“what is P;?”’ In that thorough-going form of the analysis 
employed in Relativity problems, which makes full use of the 
idea of a four-dimensional space-time, one speaks of time 
component with the same justification as space-components. 
But at this juncture we had better avail ourselves of more 
familiar conceptions. If we revert to equation (3) of the 
second chapter, we see that 


: 2 — x2 — y? — 72 — 1, 
and hence ti — xX — yy — 2 = 0 
or t= Kv_ + Woy + 20, ; , a A) 


which is really only another way of writing equation (6) of 
that chapter. 


* “ Proper ’’? mass may also be used. 
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So if we assume that p is a constant for a given particle as 
well as an invariant, it follows from (4) that 


d(pi) dr = X0,d(u%) |dr, 
and hence by (r1) 
P= > (v,P x) 
= B&(v,F x) 
=fv.F),. . (5) 


using the usual vector symbol for the geometric product of 
velocity and force. As a consequence of this result, it follows 
that in order that the suggested equations of motion may 
survive the relativity test, any suggested Jaw of force must 
permit the quantities 


BE’, BF y, BF, Biv : F) 
to be cogredient with the space-time co-ordinates. In short, 
if the law when employed by observers in S gives F,, Fy, F, as 
the components of the force on a particle at a given point- 
instant, and the same law as employed by observers in S’ gives 
F,’, Fy’, F,/ as the components at the same point-instant, then 


BF,’ = a[ BP, — uBiv . F)] 
Bio Ole a ai. : one) 
Biv’. BY) = af Bv . F) — uBF,] 

It will appear later that when subjected to this test, the 
law for calculating the force on a charged body moving in 
an electromagnetic field is satisfactory ; but the well-known 
Newtonian inverse square law for gravitational force is not. 
But these matters will be deferred to the next chapter. We 
will return at the moment to the discussion of the broadened 
conception of mass, and we now find ourselves in a position to 
interpret the fourth member of equations (1), whose peculiarity 
we noted at the outset. We can now write it 


d(uB)/dt = P/B 
or dm/dt = (v : F) 5 : 4 6 (7) 


It was stated that this equation would turn out to be an 
old friend in disguise. It is, in fact, the equation of energy. 
For multiplying each side of (7) by an element of time 8¢ (for 
the frame S), we get 

6m = XF,0x. 


The right-hand side is, if interpreted on the usual lines, the 
work of the force on the particle while it is displaced along an 
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element of path whose components are (8x, Sy, 52). Consequently 
dm is the increment of the particle’s kinetic energy during that 
displacement, and thus the kinetic energy of the particle must 
be the integral of dm from the condition of rest. But at rest 
the value of m is ~. Hence the kinetic energy of the particle, 
when its velocity is v, is equal to m — p or p(B — 1). 

As one may reasonably expect, this extremely important 
result degenerates to the usual expression for small velocities, 


for 
w(B — I) = pl[(r — v*)-# — 1] 
= plgu? + gut + .. .], 


which approaches 4yv? as v approaches zero. But, of course, 
the really striking feature of this result is the establishment of 
the identity of kinetic energy with the increase of mass due to 
the motion. It requires some mental effort on the part of 
those unacquainted with recent electromagnetic theory to take 
to such a notion kindly. But at the outset the warning was 
uttered that we were to disabuse our minds of all preconceived 
notions of mass, momentum, and force; that these words were 
to be accepted as convenient names for certain terms in the 
suggested equations of motion, which formally make their 
appearance in the same character as do terms in the Newtonian 
equations bearing the same names. To those, however, who 
know some electromagnetic theory, this apparent identity of 
mass and energy will not come as a shock. As has been men- 
tioned, almost forty years ago J. J. Thomson showed that a 
charged body has its inertial mass increased by the fact of its 
electrification, and the amount of this increase is proportional 
to e?/a where e is the charge and a a linear dimension of the 
body (its radius in the case of a sphere). He further showed 
that this ‘‘ electromagnetic mass’ increased with the speed of 
the charged body. Later it was found that the assumption 
that the whole mass of an electron could be accounted for in 
this electromagnetic manner was perfectly justified by experi- 
ment; and, finally, Lorentz showed by a special hypothesis 
concerning the deformability of the electron, that the “ rest- 
mass ”’ of an electron was proportional to e?/a, and the mass at 
speed v was proportional to fe?/a, the speed being interpreted 
as speed through the ether. . Now the point to be emphasised 
in this connection is that e?/a has the dimensions of energy in 
electrical theory. Therefore, so far from arriving at a non- 
sensical result, Relativity has deduced as a general proposition 
a conclusion which had already been suggested by a special 


80 RELATIVITY 


branch of physics in a special case, with the aid of certain 
ad hoc assumptions. Moreover, if we are to identify energy of 
motion with increase of mass due to motion, it is but a natural 
process to identify the entire mass of a body as a measure of 
energy also, so that the ‘“‘ rest-mass”’ would be but another 
name for an amount of energy “‘intrinsic”’ to the body, and 
existing in it, independent of its motion. There is nothing to 
oppose such an assumption, and a good deal to support it. 
We have seen it justified in the case of the electron. As further 
evidence, we have only to mention the well-known fact that 
most theories as to atomic structure which are being propounded 
and developed to-day, assign practically the entire mass of an 
atom to its positive nucleus, and do so by postulating an ex- 
tremely small nucleus, so that the electromagnetic measure 
of the mass may produce a sufficiently large result by reason 
of the small linear quantity occurring in the denominator 
of the expression e?/a. There is still a more striking illus- 
tration of the assent which is being given to the belief that 
energy and mass are identical. In attempting to account 
for the periodic properties of elements, suggestions have been 
put forward that all atomic nuclei are built up of hydrogen 
nuclei or “‘ protons,” with attendant electrons. This would 
apparently imply that all atomic weights could be obtained by 
adding simple multiples of the weight of a proton and of the 
weight of an electron. Now this is not so, but the discrepancies 
between the actual atomic masses and those calculated on this 
theory of atomic structure are accounted for by what is called 
“nuclear packing.” In simple words, it is assumed that if 
these ultimate electrified particles have their mutual distances 
of separation altered so that their ordinary electrostatic potential 
energy is altered, then also is their total inertial mass as a group 
altered, and it is not justifiable to regard this total mass as a 
mere arithmetical sum of the separate masses of the particles 
when isolated, but as a quantity which varies with change of 
internal structure in the group, just as the energy of the group 
varies. It is, indeed, a sign of the power of the Relativity 
principle that it unifies in a direct and simple way so much 
apparently unrelated material in our physical and chemical 
schemes. It is a far cry from the Michelson-Morley experiment 
to the periodic table of the elements, yet Relativity discloses an 
unsuspected relation. 

We have hitherto made no use of a possible variability of p, 
the rest-mass or intrinsic-energy (whichever name we choose to 
give it), although we have hinted at such a possibility. (In this 
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connection it may not be out of place to reiterate the warning 
against confusing “‘ constancy ”’ with “ invariance.”’) But we 
have allowed for it, nevertheless, in our equations by writing 
d(udx/dr)/dr, etc., at the outset, and not pd?x/dr?, etc. Also, 
in (7) we see that a finite dm/dt might be the result of an altera- 
tion in » as well as an alteration in v and 8; so that it would 
be theoretically possible for a force to be accompanied not by 
an acceleration of the body’s motion, but by an alteration 
in its intrinsic energy. To illustrate this possibility, let 
us consider a case which was once proposed as an example 
in which Relativity led to a contradictory result. A body 
is at rest in the frame S, and is radiating energy. Obvi- 
ously, there is no ground for believing that it would begin to 
move in S, if isolated from the action of other bodies. Hence, 
in the frame S’, it should move with a uniform velocity, viz., 
the relative velocity of the frames. But on electromagnetic 
theory such a body would, if in motion, be subject to a resistance 
due to its own radiation, and therefore should be retarded in 
S’. The reconciliation lies in the fact that if we offer to discuss 
this apparent contradiction we must, in order to justify the 
Relativity result, use Relativity mechanics, which does not 
deny the conclusion that there is a “retarding” force 
as obtained by electromagnetic theory, but denies the 
necessity of a retardation on that account. Decrease of 
momentum there must be, but the decrease is due to the 
diminution of the inertia factor and not of the velocity factor ; 
for as the body is radiating, its intrinsic energy grows less, 
i.e., its rest-mass is decreasing. 

We should have to admit in a similar fashion that if a body 
were rising in temperature, it would require a force to maintain 
it in uniform motion in a given frame, on account of its increasing 
intrinsic energy. If any astonished person inquires for the 
“mass of 1 calorie of heat,’’ he will not be perpetrating a joke, 
but asking a perfectly reasonable question, and one to which 
an answer is possible. Let it be recalled what fundamental 
units we are using; they are I centimetre for length, and 1/c 
second for time. Hence our unit of velocity is c cms. per sec., 
and our unit of acceleration is c? cms. per sec. per sec. So 
if we wish to retain the erg as our unit of energy, and the dyne 
as our unit of force, we must use I/c? gram‘as our unit of mass ; 
for a body with that rest-mass, and moving off with an initial 
acceleration equal to our unit of acceleration, would be subject 
to a force of 1 dyne. In brief, just as mass and energy have 
become identified, we find also that I erg is equal to I/c? gram, 
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or, if one dislikes that way of putting it, we can say that 
if the energy of a body, intrinsic or kinetic, increases by c?, 
i.e. 9 X 10” ergs, its inertial mass increases by I gram. | 

Now suppose I calorie of heat is absorbed by a body without 
performance of external work by it, the intrinsic energy has 
increased by 4:2 X 107 ergs, and therefore the body’s mass is 
greater by (4:2 x 107)/(g X 10”), or 4:7 X Io~*4gram. Sucha 
numerical result does not hold out much hope of testing this 
conclusion by experiment, but it is a justifiable deduction from 
Relativity. 

The term, “‘ adiabatic’ motion, is a convenient one to apply 
to motion in which the intrinsic energy or rest-mass of a body 
remains unaltered. 

Questions which cannot be more than referred to here will 
naturally arise in the inquiring mind. We have grown accus- 
tomed since the days of Maxwell and Faraday to conceive of 
electrostatic and magnetic energy as localised in space, “‘ in 
the ether,’’ in contrast to the view of mathematicians of the 
eighteenth and early nineteenth centuries, who concentrated 
their attention on the “ charges ”’ of electricity and magnetism 
in matter. No such diffusion was more than hinted at in the 
case of gravitational energy, but undoubtedly all people 
regarded the energy of strain (a form of potential energy) as 
localised in the bent, stretched or twisted material. As a 
result, anyone may now justly ask if we are to diffuse the 
mass accompanying potential energy in precisely the same way. 
The answer must be in the affirmative. Indeed, we have 
already as common features in our text-books of advanced 
physics such phrases as radiation-pressure, the electromag- 
netic mass and momentum of a finite beam of radiant energy, 
etc. What the Relativity theory does is to make such a diffusion 
a natural feature of its mechanics from the beginning, not a 
conclusion which is reached from advanced considerations in 
a special branch of physics. Ifa body is decreasing in speed, 
it is losing energy and mass. The one loss cannot take place 
without the other. This energy and mass may presently appear 
in other bodies ; both must appear together (not merely energy 
alone), but in the meanwhile, since nothing is propagated with 
a speed greater than that of light, they must have been localised 
in a definite volume between the bodies, i.e., as we Say sig 
space,” at each definite instant. There is no escaping the 
conclusion. No doubt it is going to complicate ‘‘ Advanced 
Dynamics ”’ for our honours students in the future! For one 
thing, it will not allow us to localise all mass in a physical 
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system quite so definitely as before. The mass of a solid or 
liquid body always seems a very real and tangible thing of 
which we have clear muscular perception ; the mass of a gaseous 
substance is only too evident in a gale, and even without that 
there is the balance to show us the difference between a flask 
before and after evacuation; but there is something very 
shadowy about the mass resident ‘‘in space.”’ It is true that 
in practice the amount within any finite volume is in general 
excessively minute compared with that possessed by an equal- 
sized piece of “‘ matter,’ but the fact remains that ‘‘ matter ”’ 
is not going to be quite so easily defined as formerly. If we 
have to ascribe mass to the somewhat ghostly haunts of energy 
designated by the phrases, “‘ the gravitational’ and “ electro- 
magnetic field,” the distinction between energy and matter 
becomes one of mere practical convenience. 

But further discussion on these topics is out of place here. 
For those who can follow the mathematics, the points raised 
will be treated by suitable analysis at a later stage, and it will 
be shown that we still preserve the conservation laws in Rela- 
tivity. Conservation of mass and conservation of energy, of 
course, merge into one law; but both it and conservation of 
momentum require us to diffuse mass and momentum in the 
manner indicated, a conceptual process quite foreign to New- 
tonian mechanics. 

Far more important for us at this point is the realisation 
that mass and force are relative concepts. Equations (6) show 
us that with any laws of force measurement which can be made 
to accord with Relativity, a force will have a different value in 
one frame of reference to that in another. Equations (3A) make 
it clear that the mass of a body varies for different observers, 
actually reaching the value infinity for an observer if it could 
attain the velocity of light with respect to him. This is the 
dynamical reason for the view that the speed of a body cannot 
surpass that of light. The material of this chapter can be 
readily summarised in the statement that we have added two 
more tetrads to those groups of four physical quantities 
which we say are cogredient with the space-time co-ordinates. 
They are the “ force-activity ” tetrad, 

BE Bly, plice PUY. «mys 
and the ‘‘ momentum-energy ”’ tetrad, , 
Mx, My, Mz, m,* 
* That these four are cogredient follows from the cogrediency of 


Bux, Bvy, Bv;, 8 demonstrated in Chapter II., and the postulated in- 


variance of pu. 
ee 
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and the rates of variation of the latter with respect to proper 
time are individually equal to the former. 

It is necessary to be on one’s guard against a not unnatural 
misinterpretation of the Relativity laws of motion. In New- 
tonian mechanics, rate of charge of momentum and product of 
mass and acceleration are in general interchangeable expres- 
sions, and either can be equated to the force. That is due to 
the assumed constancy of the mass. It is quite otherwise in 
the new mechanics, e.g., 


F, = d(mv,)/dt 
= mdv,/dt + vydm|dt 
= ma, + B?mvv,dv/dt ; ' Pe ies)) 
Similarly, Fy = may, + B*mvo,do/dt 


F, = ma, + B2mvo,do/dt 


Hence we see that product of mass and acceleration is only 
one part of the force exerted on a body, that in addition to a 
component of force equal to the mass-acceleration product, 
and in the direction of acceleration, there is a further com- 
ponent in the direction of the velocity, and equal to 

B?mv*dv/dt. 
This latter component vanishes for constant speed (magnitude 
of velocity). This is the case when the acceleration is normal 
to the velocity in direction. In this case the equations reduce 
to the form 

ee 


and two similar equations, the force being also normal to the 
path of the particle. Another special case arises when the di- 
rection of motion is unchanged. Let this direction be the axis 
of x; then, dropping suffixes for the moment as unnecessary, 


F = ma + Bmv?a 
ma|(I — v?) 
pal(r — v?)}, 

Thus, in the special case of a “‘ transversal’’ force, the 
quotient of force by acceleration is the quantity m or p/(x as v?)s 
and in the case of a “longitudinal” force the quotient is 
m/(i -— v) or p/(xX — v*)?. On this account these quantities are 
sometimes called the “transversal”? and “ longitudinal ” 
masses respectively. It should be remarked, in passing, that 
the experiments on the increased mass of high-speed electrons 
have all been of the “ transversal” type. Since 


vdu/dt = Yv,dv,/dt = Sv,az, 
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it is not difficult to show that equations (8) reduce to 


Fy = Biuf{(i — vy? — v,2)ay + vyVyay + v0.0; 
Fy = Bu{vyvyax + (1 — U7 — V,7)dy + VydAz}+ ~ (9) 
F, = Bu{v,v,a, + v,v,a, + (I — 0,2 — 0,2)a;}| 
These rather complicated looking equations can be shown 
to embody the result of blending the two special cases referred 
to. For, suppose at a given instant we choose the axis of x to 
be the tangent to the particle’s path, the axis of y to be its 
principal normal, and the axis of z to be normal to its osculating 
plane, then 
Re 970 


and ih = @. 


Z 


Hence, writing v, a; and a,, for v,, a, and ay, respectively, 

we obtain 

F; = Pena; (tangential components) 

Fy, = feu(i — v)a, (normal components) + (20) 

= Puan } 

and the remaining component of force is zero. Hence, if we 
resolve along tangent and principal normal, the tangential or 
“longitudinal ’’ component of force is equal to the product of 
the corresponding component of acceleration and the “ longi- 
tudinal ’’ mass, while the normal or transversal component of 
force involves the “ transversal’’ mass as the factor of the 
normal acceleration. This ambiguity about mass, however, 
only arises in connection with the mass-acceleration rule for 
force ; there is no doubt about mass when we use the form of 
the law involving rate of change of momentum, and, as a 
general rule, it is in that sense (a factor of momentum) in 
which it is used, being then also a measure of a body’s total 
energy. 

As a direct deduction from equations (8), (9), or (10), it 
appears that uniform acceleration does not imply a uniform 
field of force, another break with the familiar views of the 
older mechanics. If, for instance, we consider the compara- 
tively simple case of a body moving in one direction under a 
constant force, we put fF, = F,=0, F, = constant and », 
=), 0.. then; by equations (9), 

Pipa Ode. 
Hence the acceleration in the frame S is not constant, but 


varies as (I — v?)!. Referring to equation (10), of the second 
chapter, we see that this is a case of uniform “ vest-acceleration,”’ 
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In short, the motion could not be one of uniform acceleration, 
for as the speed increases the mass increases without limit as 
the velocity of light is approached, necessitating a proportional 
and likewise unlimited increase in the force to maintain a 
constant acceleration. Another way of arriving at this result 
is to consider at any instant a frame S’ in which the body is 
momentarily at rest, the “ rest-frame.’’ Then 


UU Rep Oe ee 
So BE,’ = afBF, — up(v. F)} 
= ab(f, — uv,F;), 
i.e., NS: = B?(z aa UA) a Fy. 


Hence the force in the frame which is at any instant the 
rest-frame is always the same as that in the specially chosen 
frame S, and so is constant likewise. Since 


Ee — [ay 


where a,’ is the rest acceleration, it follows that a,’ is constant. 

But even this element of simplicity disappears when we 
have curvilinear motion under a constant force. In classical 
mechanics we have parabolic motion obtained by compounding 
uniformly accelerated motion in one direction with uniform 
motion in a perpendicular direction, and it would not be 
unnatural for anyone to jump to the conclusion that in the 
new mechanics a trajectory under a constant force would be 
a hyperbola, obtained by combining the rectilinear motion 
having a uniform rest acceleration, and given by the equation 


x? — 2 = (1/g)? 


(see equation (12), Chapter II.), with the rectilinear uniform 
motion, 


Vi" Rh 


The fallacy in this procedure, however, will be evident on 
considering the increase in mass; for since the mass increases 
with the speed, it will require a y-component of force to main- 
tain a constant y-component of velocity ; and as the speed at 
a point of a hyperbolic trajectory is greater than the speed at 
the corresponding point of the rectilinear path with uniform 
rest-acceleration, it will also require an increasing x-component 
of force to maintain an x-component of acceleration, varying as 
(1 — v,*)?, so as to keep the x co-ordinate still connected with 
the time by the equation «? — #2 = constant. This is very 
manifest on taking equations (9) for two dimensions, with a 
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zero a, and constant vy, viz., 


Fy = Bult — vy2)a, 
Fy = Bpv,v,a,. 

If we wish a, to vary as (I — v,?)?, then Fy, must vary as 
(I — vy)? (1 — v,?)? /(x — v?)?, which increases without limit as 
v,* approaches I — v,”, or as v? approaches unity ; and, further, 
F,/F, varies as vy, and so also increases without limit. Con- 
sequently hyperbolic motion of this type cannot be produced 
by a constant force. 

Leaving these rather special considerations, we find ourselves 
still faced with the question as to the relativity of the forces in 
different frames, i.e., the validity of the equations (6). Our 
minds turn naturally to forces in an electromagnetic field and 
forces in a gravitational field. As regards the former, the 
relativity of the equations of the electromagnetic field and of 
the forces, as derived in the usual way from the electric and 
magnetic vectors, will be demonstrated in the next chapter. 
Gravitation, however, cannot be dealt with satisfactorily 
within the limits of restricted Relativity. 


CHAPTER IV. 


THE modern treatment of electromagnetic phenomena is based 
on the equations first propounded by Maxwell, connecting the 
so-called electric and magnetic vectors, or intensities. These 
vectors are not quantities capable of direct measurement, but 
are only derivable by definite mathematical manipulation of 
symbols representing other and more readily perceptible 
quantities. Thus, ifin an electrostatic field of force, we measure 
the mechanical force on a charged body, we must divide this 
by the magnitude of the electrical charge on the body to obtain 
the ‘‘ intensity ”’ of the force, i.e., if the body be at rest in the 
field. If the body be in motion through the field, this simple 
arithmetical operation will be insufficient for the purpose, and 
further steps are required whose nature it is ourimmediate object 
to elucidate. However, the assumption is implicit in all electro- 
magnetic theory that at any definite point within the region of 
influence of our appliances (charged bodies, currents, magnets), 
etc., there is a definite intensity of electric force which we 
shall denote by the electric vector E, and a definite intensity 
of magnetic force, denoted by the magnetic vector H. Ina 
purely electrostatic or magnetostatic field one or other 
vanishes. Furthermore, the modern electron theory and the 
modern theory of the structure of atoms and atomic nuclei 
leads directly to the view that, apart from the special and 
artificial appliances of man, the propagation of all forms 
of radiant energy involves a variation with time (in general, 
very rapid) of the values of KB and H at any given point 
in the space occupied at the moment by the radiation. If 
a charged particle carrying a charge e be situated at the point 
and at rest there, the force on it is eK dynes in the direction 
of KE, but if the particle be in motion with a velocity v, there is 
in addition to eK a force equal to e[v . H], and in the direction 
of [v . H], where [v . H] is the very convenient symbol for the 
so-called “ vector product ’”’ of v and H, that is, a vector quan- 
tity whose magnitude is the product of the magnitudes of v, H 
and the sine of the angle between their directions, and which is 
itself directed at right angles to both v and H in that sense 
38 
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which is related by the usual right-handed screw rule with a 
rotation from the direction of v to the direction of H. (The 
sense is indicated by writing V first inside the square brackets.) 
This formula is a generalisation from the well-known expression 
for a force exerted on an element of a conductor through which 
a current is passing, when situated in a magnetic field, and is 
derived by considering the current as equivalent to a stream 
of electrified particles (electrons). In general, the force on a 
charged body in an electromagnetic field is given by the ex- 


pression 
e( E+ [v . H)), ; : > (3) 


the addition, of course, being vectorial. 
The Cartesian components of this force are 
e(E, i VyH, a v,Hy) 
e(E, + v,H, — v,H,) 
ek, + v,Hy — v,H,)! 


“e 


(1A) 


This is frequently referred to as the “‘ ponderomotive force on 
the charge e,’’ a term introduced by Lorentz. 

In Lorentz’s analysis there is a privileged frame, “ fixed in 
the ether,” and it is with respect to that frame that the velocity 
v of the charged body is estimated. Furthermore, since all 
magnetic fields are now regarded as produced by currents of 
electricity (“ molecular,” in the case of magnetic bodies), and 
all currents, conduction or convection, are regarded as streams 
of electrons and ions, the magnetic field is regarded by Lorentz 
as due to the motion of these ions and electrons through the 
ether. From the Relativity point of view, on the other hand, 
the velocity is simply the velocity of the charged body relative 
to the observer and his measuring appliances ; and, moreover, 
the magnetic field is the direct result of, and is quantitatively 
related to, the velocity of the electrons, etc., relative to the 
observer. So that if a charged body were at rest relative to 
the observer, there would only exist for Aim an electric field 
(E) ; if it is moving relatively to him, or he to it (the mode of 
expression is really immaterial), there would exist both an 
electric and a magnetic field (KH and H). This must be borne 
in mind in the subsequent development. 

Now the vectors E and H are functions of the co-ordinates 
and time, and are dependent on one another, no matter how 
produced. Supposing, for example, we have a system of elec- 
trified bodies at rest producing only an electric field, which 
varies from point to point but not from instant to instant. 
_Now we disturb the arrangement in some way, move the bodies 
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about or alter the charges. After the system has settled down 
in a new static arrangement, we have a new electrostatic field 
with no magnetic field, but in the interim the alteration of this 
field has taken place at a finite rate, and not instantaneously. It 
requires time for the new values to be propagated to a distance ; 
in this interval there exists a magnetic field which does not 
spring into existence in any given place at the absolute inception 
of the rearrangement, nor vanish to zero at the very moment 
at which the rearrangement is complete, time being required to 
propagate changed values of H across a finite gap. In fact, as 
is well known, the rate of propagation of “‘ phases”’ of HE and H 
is just the velocity of light. No matter what may be the cause 
to which we ascribe the changing of the fields, Maxwell showed, 
on the basis of Ampére’s and Faraday’s laws for linked electric 
and magnetic circuits, and with the introduction of his well- 
known hypothesis of the “ displacement current,” that the 
spatial and temporal rates of variation of the components of 
E and H are connected by the following equations : 


dE,/dt = dH,/dy — dH, /dz 
dE, /dt = IH ,/dz —7H,/d% 
dE, /dt = IHy/dx —.dHy/dy 
— 2H,/dt = rE,/d¢y — dE, /dz 
— dffy/d% = dE,/dz — dE,/dx 
— 0H,/dt = dEy/d« — dE, /dy 
dE y/dx+ IE,/y + IE,/dz = 0 
dH ,/dx + dH,/dy + d0H,/dz = 0, 


provided these refer to points in a region not occupied by 
electric charge. As regards this form of the equations, it is 
necessary to make two remarks before proceeding. 

(1) The velocity of propagation (im vacuo) of light and all 
electromagnetic disturbances is, as before, assumed to be unity. 
This proviso enables us to drop the factor 1/c, which usually 
appears multiplying the left-hand members of the first six 
equations. Indeed, as stated previously to turn from formule 
involving the Relativity unit of time to formule involving 
seconds, we replace ¢ by ct. 

(2) The awkward multiplier 47, which frequently occurs in 
other modes of writing these equations, is removed by choosing, 
as suggested by Heaviside and Lorentz, units of charge and 
pole-strength, respectively equal to 1/27 of the customary 
electrostatic unit of charge and unit magnetic pole. In so 
doing we must, if we are to retain the dyne and erg as our units 
of force and energy, measure the electromagnetic vectors or 
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intensities in terms of units, which are in their turn respectively 
2nt times greater than the customary electrostatic unit of 
intensity and magnetic unit of intensity. 

If we wish to apply Maxwell’s method to points within 
electrified matter, and can assume that the charged particles, 
ions and electrons, are not infinitesimal in size, but are finite, 
so that around no point is there more than a finite density of 
charge, we can employ a generalisation of Maxwell’s equations 
due to Lorentz. These are generally referred to as ‘“ micro- 
scopic,’ for they purport to hold down to any degree of fineness 
in subdivision of matter, and, in particular, they take all con- 
duction or convection forms of current to be manifestations of 
simple movement of charged matter. So that if we could 
identify an element of matter having a charge density of p units of 
charge per c.c., moving with a velocity v, we should say that at a 
point surrounded by this element there is a “ current density,” 
pv, with components pv z, pvy, pu; Space does not permit of 
any adequate discussion of this conception, but any of the 
standard works on the electronic theory of matter will supply 
the necessary critical investigation of the assumptions which 
lie at the back of it. The equations as modified by Lorentz 
are then : 


dE ,/d¢ + pv, = 0H,/dy — dHy/dz. (Zz) 

and two similar equations ; 
— 0H, /d¢ = dE,/dy — dE,/dz ; oe (-)) 

and two similar equations ; 
dE ,/dox% + dE,/rxy + dE,/dz =p . . (4) 
dH,/dx% + dH,/dy + dH,/dz = 0, : A 
which, of course, reduce to the previous set at points where 


== 0, 
For those familiar with the vector notation, the following 
way of writing these equations has the great merit of con- 
ciseness : 


dB/dt + pv = curl H ee) 
— dH/et=culE . ; rn G70) 
div. E = p : . (4A) 

div. H=0,.. : : . (5A) 


When we proceed to interpret these we are at once faced 
with the now familiar question: To what frame of reference 
is the velocity v referred? To Lorentz the frame was the 
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stagnant ether, and he devoted himself with great ingenuity 
to the problem of finding the form which these equations would 
take if referred to other frames, moving through the ether, in 
the endeavour to discover if the equations retained the same 
form or one approximating closely to it, when referred to one 
of these frames. To Einstein, endeavouring to establish the 
Relativity theory on as wide a basis as possible, the problem 
was as follows. These are the forms of the equations for any 
frame ; (x, y, z,t) are space-time co-ordinates of a point-instant 
in that frame ; V is the velocity of a charged element relative 
to that frame. Adopting the transformation equations of the 
space-time co-ordinates from frame to frame, as already laid 
down, what are the relations between the electromagnetic 
vectors as measured by the observers in one frame and by the 
observers in the other, and having found these relations, are 
they consistent with any known facts concerning electro- 
magnetic phenomena? Further, will these relations between 
the vectors yield the relations between the mechanical, “‘ pon- 
deromotive,” forces on a charged body, which we discovered in 
the last chapter to be the necessary conditions for the Relativity 
of the extended equations of motion there developed ? 

In order to obtain the answers to these queries we must 
develop a few mathematical relations which are necessary for 
our purpose. 

Suppose that 4(x, y, z, t) represents a function of the space- 
time co-ordinates of a point-instant in S. Now for x, y, z, ¢ 
substitute their values in terms of x’, y’, 2’, ¢’, the space-time 
co-ordinates of the same point-instant in S’, viz., a(x’ — u’t’), 
y’, 2, a(t’ — u’x’). A function of x’, y’, 2’, t’ is obtained,which 
we represent by v(x’, y’, 2’, t’). Let us see how the partial 
differential coefficients of 4(x, y, z, 4), with respect to x, y, 2, 
and ¢, are related to those of r(x’, y’, 2’, t’) with respect to x’, y’, 
Bead. 

Thus 


dp/d% = p/dx" . dx"/X + IW/dy’ . dy'/dX + Ib/dz2’ . d2"/d% 
+ dap /de’ . de//dx 
= dp/dx' . {a(x — ul)}/dx + d/de’ . fat — ux)}/dx 
= a(dy/dx’ — udw/d!’). 


Similarly, 
op/dXy = dInp/dy’ 5 d/dz = Inp/dz2’ 
and ob/dé = a(dp/dt’ — udy/dx') : .- (6) 


Now £,, Ey, E;, H,, Hy, and H, are each functions of 
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x, ¥, 2, ¢ in general, and by the transformation equations can 
also be expressed as functions of x’, y’, 2’, t’.. So making use of 
equations (6), we see that the equations (2) can be written: 


a(dF,/ dé’ — udE,/dx') + pry = dH,/dy’ — IHy/d2" 
a(dhy/dt’ — udEy/dx’) + pry = Hz/d2z’ — a(dH,/dx' — udH,/dt’) 
a(d£,/d¢’ — udk,/dx') + pu, = a(dH,/dx’ — udH,/d’) — dH,/ dy’ 


The second and third of these happen to be more easily 
manipulated than the first, so we shall deal with them prior to 
the first. The second, for instance, requires little rearrange- 
ment to give 


d{a(E, — uH,)}/dt’ + puvy = dH,/d2’ — Ma(H, — uEy)}/dx’, 

and the third yields 

df{a(E, + uH,)}/dt’ + pu, = fa(Hy + uE,)}/dx' — H,/dy’. 
So if we put 


iy = all, —vi,) 
BE, =al—, + 4H,) 
Hi,’ = H, 
Hy = aly 4 ve; 
H, =a(i, — uly) 
p'dy' a Rey) 
ip U5 — pl, 


we could write these two in the form 


dE,’ /dt’ + p'vy’ = dH,’ /dz' — 0H,’ /dx' 
dE,’ /dt! + p’v;' = dH,'/dx' — dHz//dy’. 
These relations also give us the hint as how to proceed with 
the first equation above, for we find that 


Ya(H, — uE,)}/oy’ — d{a(Hy + uEz)}/d2" 


| ll 


I tl Il 


a(dH,/dy’ — IHy/dz’) — au(dE,/dy’ + dE,/2z’) 

a(dH,/xy — dIHy/dz) — au(dEy/dy + dE,/dz) 

a(dE,/dt + py) — au(p — dE ,/d%) by (2) and (4) 
a?(dE,/ dt’ —udE,/dx') +ap(v,—u) +a*u(dE,/0%’ —udE ,/ dt’) by (6) 
a%(1 — #?)dE,/dt’ + ap(v, — 4) 
dE ,/dt’ + ap(vz — %). 


Hence, if in addition to the above symbols, we further 


introduce 
Ey = Ex 
p'Uz' = a(pvy — up), 
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we see that the first equation becomes 
dE,//dt! + p'v,/ = dH, ry’ — dIHy'/dz". 


It is now an easy matter to show that with the same defini- 
tions of the accented symbols we can obtain the following from 
equations (3) : 

ee dH y'/ de’ = dE, /dy’ a- Ey’ /dz" 


and two similar equations. 
Further, we obtain from (4) and the definitions of the 
accented in terms of the unaccented symbols : 


DEq!/dx! + dEy! dy" + dE, [dz! 
= a(dEy'/dx — w'dEy'/dt + dEy'/y + IE, /d2 
= adk,/dx+audk,/dt+ d{a(Ey—uH,)}/dy+ dfa(E,+uHy)}/dz 
= a(dFy/d*x+dEy/dIy+dE,/d2) —au(dH,/s9—dH,/dz—dE,/d) 
= a(p — uprx), 


which we may write as p’, adding another and final member to 
our accented symbols. 
It is also easy to show that equation (5) leads directly to 


dH’ /dx’ + dIHy'/dy! + dH,//dz’ = 0. 


Having gone through the gamut, we now can say that the 
differential equations with respect to x’, y’, 2’, t’, which relate 
the quantities Ey’, 2,’ By, fi, Hy Be, pvcsp Oy. ip oy ps 
one with another, have precisely the same form as those with 
respect to x, y, 2, ¢, which relate E,, Ey, E,, H,, Hy, Hz, pox, 
pUy, pz, p With each other if 


Ey = Ey, E,' =a(E, —uH,), E/ =a(E,+uHy) (7) 
H,! = H,, H,' = a(Hy + uE,), H;) =a(H,—uEy) (8) 


px’ = a(pvz = up) 
ate ce ate ed c ‘ . (9) 
p = a(p — upv,) 


These equations supply the answer to the inquiry : ‘‘ Are the 
equations of the electromagnetic field invariant ?’’ They are 
if the measures of the various electromagnetic quantities in S 
are related to the measures in S’ by equations (7), (8), (9). 
The notion that electric or magnetic intensities at a given 
point would be measured differently by different observers was 
not novel when Einstein first demonstrated the result given 
above. For example, it would have been naturally urged that 
a charged body at rest in the ether experiences a force eH when 
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at rest in the ether, whether a magnetic field exists or not, but 
if it moves through the ether with a velocity u, the force on it 
would be the vectorial sum of eE and e[u.. H] if there were a 
magnetic field ; and an observer moving with the body would, 
on dividing the force on it by the electric charge, find the 
electric intensity (as he would call it) to be E+[u.H]; 
i.e. (with our usual choice of axes), an intensity with com- 
ponents, 
E,, Ey —uH,, E,+ uHy. 


Similarly, by considering a magnet pole of given strength 
at rest in the ether and then in motion, we should arrive at the 
result that an observer in motion through the ether would 
obtain as the magnetic intensity H — [u.. 5], or with our axes, 


H,, Hy + uE,, H, — uEy. 


These would be the results derived from older views untinged 
by any thought of Relativity, and they, at all events, show 
the dependence of the electromagnetic vectors on motion. So 
strong, however, was the impulse towards making the field 
equations as alike as possible for all observers, no matter what 
their motion through the ether might be considered to be, that 
these values for the moving observer were not regarded with 
any great favour ; for if these are combined with the ordinary 
Newtonian space-time transformations, 

x = % — Ub 
a a 
bi, 
the transformed equations of the field lose all the simplicity 
and symmetry of the original set as written for a frame at rest 
in the ether. Lorentz, employing his own transformation for 
space-time co-ordinates, arrived at a set of relations between 
the accented and unaccented symbols, which almost gave the 
same form to the field equations for the observer moving through 
the ether as for him who is at rest in it. It was Einstein who 
pointed out that if complete invariance be required equations 
(7), (8), (9) will suffice. But if they are examined it appears 
that there exists the same reciprocity as we discovered in the 
case of the co-ordinate transformation. Thus we find that 
E,=E,, LE, =a(E, —wH,); E,=a£,; + wH,’) 
H,=H,, #, =H,’ + wk); H,=(H, — wE,’) 
pix a a(p’v," eat u'p’) ; 
Py = p'Vy' 5 plz = p'Us 
p= a(p’ et u'p'V,'). 


ee 
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In other words, these relations supply no criterion as to 
whether the S or S’ frame is the one at rest in the ether, or for 
that matter, that either of them is so “ fixed.’ Once more, 
motion through the ether has become irrelevant ; it serves 
no useful purpose if complete invariance of the equations be 
demanded—at all events, so far as uniform motion is con- 
cerned. 

It will be observed that the Einstein relations differ from 
those developed on earlier views by the inclusion of the factor a, 
which, of course, exceeds unity by excessively small amounts 
for usual velocities of relative motion between matter and 
matter. 

Equations (9) lead to a very important result. It will be 
observed that these constitute an ordinary Lorentz transforma- 
tion, and show that pv z, pvy, pv, p are cogredient with the 
space-time co-ordinates. But we are already aware that 
Bvz, Bvy, Bv, B are cogredient (where B = (x — v*)—#). It 
follows at once that 


p/B = p'/B’, 


or the quantity p/8 is invariant for a given body, and will, 
therefore, have as its invariant value the density of the charge 
on the body as estimated by an observer to whom it is at rest ; 
for in his frame v = 0, and therefore B = I. 

Now, if 7 be the volume of the body in the frame §S, and 7’ 
its volume in S’, we know from Chapter I. that Br = f’7’, for 
each of these is the volume of the body in its momentarily 
proper frame, i.e., as measured by the observer to whom it is 
at rest. Hence 


pT = pT. 


So the quantity of charge on the body is the same to all 
observers. This result is of the greatest importance in the 
Relativity theory It is referred to as the invariance of electric 
charge on a body. All observers measure the charge to be 
the same. For those to whom the body is in motion, the 
diminished volume of the body is just compensated by the 
increased density of its charge. 

This result must not be confused with the well-known result 
known as the conservation of electricity or the continuity 
equation. This latter refers to any one frame individually, 
and is derived from the equations of the field. Thus from (4) 
we obtain 
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dIp/dé = I(VIE,/dx)/dé 
DIF ,/d¢)/dx 
— Ld(pvz)/dx 
i.e., dp/dt + Svzdp/dx + pLdv,z/Ix = o. 


oe | 


Those familiar with hydrodynamical equations will recognise 
that this equation accounts for any change in electric charge 
within a given surface by a flow of electricity across the surface, 
no creation or destruction being required. But this result has 
no direct bearing on the measures of charge made in different 
frames. It is true in any frame; but it could conceivably be 
true, even if S observers and S’ observers made different 
measures of charge. There are an infinite number of relations 
between charge measures in the two frames, other than that of 
equality, which would be consistent with conservation in each 
frame. 

In passing, it is worth while noting two results, which can 
be easily verified by equations (7) and (8), viz., that 


Hi 2 =f Fe 
and (Eo) = (E114, 


or the difference of the squares of the intensities of the magnetic 
and electric fields and their geometric product are invariant for 
any Lorentz transformation. The second one yields as a 
special result that if H and H are perpendicular to one another 
in S, then likewise EH’ and H’ are perpendicular in S’. 

We shall now turn to the question of the relativity of the 
“ponderomotive’”’ forces on a charged body, as embodied in 
the equations (2) and (6) of the last chapter. 

In the frame S the force on a body (idealised as a particle) 
with a charge e is e(E, + v,H, — v,H,) and two similar com- 
ponents. In the frame S’ it is e(E,’ + v,'H,' — 2,'H,’), etc. 
We need not accent the symbol e, on account of the invariance 
of the charge just referred to. We have equations (7) and (8) 
as the relations between E, H, E’, H’, and the now familiar 
equations connecting V and V’. 

Hence, if we write F for e(E + [vy . H)), 


pie seo (e y+, Hy —0, Hy) 
= efa(B — uBv,)E, + Bryo(H, — uy) — Bo,a(Hy + uk,)} 
== ea{B(E, + vyH, — v,Hy) — uBlog, + Uyly —- Uzli2)} 
Now it is easy to show that 
(Op LOS DD a sayy Dee 
7 
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Hence 
BF,’ = afBF, — uB(v . F)}. 
Further, 
p'F,’ = ef'(E,' + 0, Hy’ — v',H,’) 
= = ¢a(6 — upv,z)a (Ey — uH,) + Bo,» — 
a(Bes — up)a(H, — uEy)} 
= ea®B(Ey — wWEy + uv,H, — v,H,) + eBvH, 
= BAe + 9,H,—v,H, 
= Bly. 
Similarly, 
6'F,’ = BF. 


It now follows, even without any further use of (7) and (8), 
that 
B’(v’ . EF’) = af{B(v . F) — uBF,}. 


Thus the relativity of the ponderomotive force on a given 
' charge (or per unit charge) is proved. 

It is interesting to note that although the ponderomotive 
force components per unit charge and its activity do not con- 
stitute a cogredient tetrad (that property being possessed 
by B times these quantities), the components and activity per 
unit volume of the charged body are cogredient. This is ex- 
tremely easy to demonstrate. For the components and the 
activity per unit volume are p(£, + v,H, — v,H,), two similar 
components and p(v . B), ie., pF,/e, pFy/e, pF./e, p(v . F)/e. 
But e is invariant. Hence the cogrediency is proved if pF, 
ply, pF: p(v .F), constitute a cogredient tetrad; and this 
is so since BF,, BF,, BF;, B(v . F) are such a tetrad, and p/B 
is invariant. 

In electromagnetic theory it is well known that there exist 
two functions, one a scalar function 4(%, y, z, #), the other a 
vector function A(x, y, z, ¢), such that 


Ey = — 06/d% — dA,/d¢ 
and two similar equations ; and (Io) 
and two similar equations or, in vector notation, 

E = — grad ¢ — dA/dt 

H = curl A (Loa) 


For since 
papa Gabe aaa Oe 
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we see that H,, H,, H; can be written as in the second group 
of equations (10). 
Also, since 
d£,/dy — Ey /dz = — 0H,/dt, 
it follows that 


AE; + dA,/d#)/ry = (Ey + d4,/2d¢)/dz, 
and two similar equations can be obtained, so that 
E, + dA,/dt, Ey + d4,/ot, E, + A,/% 


are the partial differential coefficients with respect to x, y, z of 
some scalar function. If we call this function — d(x, y, z, t), we 
have the first group of (10). 

By putting these values for B and H in the equations (2) 
and (3), and making use of (4) and (5), we can show that 
A,, Ay, A;, and ¢ are solutions of the differential equations 


07A ,/d¢2 — 97A,/dx? — 97A,/dy? — 2A,/d22 = puz| 
and two similar equations in 4, and 4,, and (II) 
d2h/d42 — d7f/dx® — d2h/ay? — 9°h/dz? = p | 


or, in vector rotation, 
2A /d22 — AA = pv) 
d*p/oe? — Ad =p J 


The function ¢ is usually called the scalar potential, and A 
the vector potential of the field, and the object in introducing 
them here is to demonstrate that they constitute another of 
those cogredient tetrads of which we have now accumulated 
a fair number. If A,’, A,’, A,’, 6’ are the values of these 
potentials in the frame S’ at a given point-instant for which 
A,, Ay, Az, ¢ are the values in S, it is not difficult to show that 


(IIA) 


A,’ =a(dy— 1h) | 
Dea Ay =A ee 
Se uA | 


In fact, equations (11) and the known cogrediency of pv,, 
ply, pUz, p suggest equations (12) ; but reference to equations 
(7) and (8) justify the cogredient property of the potentials. 
For 


E,/ = — dd'/ax’ — 9A,"/d0’ ‘ 
= — aldd’/dx — u’dd'/dt) — a(dA,'/d% — u'dIA,z'/dx) 
= — da(d’ — wA,')/dx — dIa(A,z’ — u'd’)/dE. 
Also d6/dx% — IA,/di = E, = E,’. 


fae! 
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This is consistent with putting 
$= alg’ — wz’) 
A, =a(A,’—w’¢’). 
Further, 
Ey’ = — o¢'/dy’ — dAy’/d?’ 
= -—- d'/oy — al(dAy’/dt — w’dA,y’/dx), 
and 
Hy == 0A 7/0" — 0A, oy 
= a(dA,'/ox — u’dA,y/dt) — dA,'/dy. 
= a(E,’ — u’'H,’) 
= — add’ /y — a%0A,'/dt + a*u'dA,’/d% 
— a2u'dAy'/dx% + a®u’*A,’/dt + att’dA,z'/dy 
= — da(d’ — w’A,’) — IAy’/dt./dy 


But E, = — 0¢/y — 04,/dE, 
and this is again consistent with 


$= ap’ — wA,’), 


So iy 


and, in addition, 
Ag cae 


Proceeding in this way, we can show similarly that all the 
equations (7) and (8) are consistent with (12). 
A case of some importance deserves special mention here. 
If a system of electrified bodies be at rest in a frame S’, 
there is a pure electrostatic field in that frame; ¢’ does not 
depend on time, and as there is no magnetic field A’ is zero. 
In fact, ¢’ is the ordinary electrostatic potential of the field. 
In the frame S the system is in motion with a velocity w parallel 
to OX. By using the equations reciprocal to (7) and (8), and 
remembering that H,’ = H,’= H, = 0, we obtain) for the 
field in S: 
= Ey) Esa ah ee ae 
H, = 0, Hy = — auE,’, H; = auE,’. 


It can be shown that the electric intensity in S is not derivable 
from a potential function by spatial differentiation. It is, of 
course, derivable from a vector and a scalar potential in the 
usual way, and we have by equations reciprocal to (12) : 


A, = — au’d' = aud’ 
Avy =O 
$ = af". 
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Hence 


Ey = — 04/d« — IA,/d¢ 

= — addf’/dx — audd’/dt 
Ey = — add’/dy 
E, = — ad¢d’/dz, 


which proves the statement made above ; for 3¢4’/d¢ is not zero. 
(¢’ does not depend on ?#’ of course, but it does on x’, which is 
equal to a(x — wt).) 

However, although E cannot be derived from a potential 
function, it is not difficult to see that the ponderomotive force 
on @ given charge can be so derived. For its components per 
unit charge are 

E,, Ey, — uH,, E,-+ ui, 
which are respectively equal to 
E,’, ak,'(i — u*), aE,’(1 — wu?) 


or Be, 16, Ey fa. 
Now Ey’ = — d¢'/d%’, 
and dp’ /dx% = a(df’/dx’ — ud9'/d?’) 
= add’/dx’. 

Hence fp Op (Oe 

E,!|a = — a-o¢" Jay! 

= — a1d"/dy 

and E,'/a = — a—1d¢'/dz. 


Thus the ponderomotive force per unit charge is derivable 
from the potential function ¢’/a, or ¢/a* where ¢ is the scalar 
potential of the fieldinS. Thus we see that a force due entirely 
to a centre of attraction or repulsion, varying as the inverse 
square of the distance does not comply with the Relativity test 
for forces. This emerges from the demonstration that the 
ponderomotive force on an electrified body (calculated in terms 
of E and H) complies with the test, whereas a force calculated 
with E alone does not. It follows at once that we cannot 
bring Newton’s law of gravitation unmodified under the prin- 
ciple of Relativity ; for there is no obliging vector, analogous to 
the magnetic vector in electromagnetic theory, to assist the 
gravitational vector to achieve that desired end. 


APPENDIX ON ELECTROMAGNETIC PHENOMENA IN A MOVING 
MATERIAL MEDIUM. 


The equations of the field which were written down in the 
earlier part of this chapter (not containing the symbol p) were 
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originally developed by Maxwell for propagation 7m vacuo. 
The equations (2), (3), (4), (5) were used by Lorentz, and applied 
by him to his attempts to explain electromagnetic phenomena 
in material media in terms of the hypothesis of electrons 
(i.e., electrons in the wide sense of minute electrified particles 
of which atoms of matter are constituted in whole or in part). 
In these equations the field at any definite point would be sub- 
ject to extremely rapid and fortuitous oscillations, consequent 
on the fortuitous vibrations of molecules, atoms, and electrons 
as pictured in the kinetic theory of matter. This would be true 
even in electrostatic and magnetostatic phenomena as ordinarily 
understood. But in discussions of electric and magnetic 
phenomena on the basis of experimental results, in which no 
consideration is paid to theories of the structure of matter, 
we generally regard matter as continuous, its density being 
obviously an average in which the irregularities due to granular 
structure are ‘‘ smoothed out.’’ Pressure and temperature are 
also familiar examples of this smoothing out process applied to 
irregularly varying molecular momenta and energy. In the 
same way electric intensity and magnetic intensity, as derived 
from actual measurements on a finite portion of matter, must 
be each an average of the values existing at every point within 
a finite, though small, volume for a finite, though small, time. 
In the following discussion E, H, and p represent such smoothed- 
out values for a physically small volume surrounding a point, 
and not actual values at a point, as was implied in the Lorentz 
equations of the electron theory given above. In fact, the 
equations we are going to write down are usually referred to 
as “macroscopic,” in contrast to the term “ microscopic,” 
applied to the former. Maxwell was the first to suggest such 
equations, being guided to them by his equations for vacuum. 
He introduced, as well as E and H, two symbols representing 
electric induction, or “ displacement,” and magnetic induction, 
Dand B. These will be familiar to every student of electricity 
and magnetism. These two vectors are related to the former 
two by the equations 


D=c . ; ; aE) 

B=.8- & : . (14) 
or in Cartesians, 

Di ela. Cae : ~ (Baa) 

Bei ete 2 . (EA) 


where ¢ and w are the values of the specific electric and mag- 
netic inductivities of the material medium. (The units are 
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again Heaviside-Lorentz units, introduced with the object of 
removing the inconvenient factor 47 from the most important 
equations.) It is understood that the medium is isotropic and 
at rest. To give as great generality as possible, it is assumed 
that the medium has also a finite electric conductivity o, so 
that if all convective and conductive currents across a small 
area within the medium are smoothed out, we derive a current- 
density per unit area normal to the flow denoted by j (jx, 7y, 72) 
which is related to E by 


j=cE Fetes es) 
or 4, == fy, etc. : : ses) 
The Maxwell equations are then: 

WD,/dt + 7, = H,/dy — dH, /dz . . (x6) 

and two similar equations, 
— dB,/dt = oF ,/ay — dE y/dz . : Ee) 

and two similar equations, 
3dD,/3« = p : > S38) 
20b,/0x% = 0 ,  “i(19) 
or dD/d# + j = curl H & CEOAY 
— dB/t =culE. ; ; eye. 
aie ip (18a) 
div B= 0 (19A) 


Equations (16) to (1g) are called the field equations. 
Equations (13), (14), (15) are called the constitutive equations. 

These are the equations for a medium “at rest,’’ and as 
usual the question arises: to what frame? In pre-relativity 
days the ether was the frame, and Maxwell, Hertz, Larmor, and 
Lorentz all suggested generalisations of the equations which 
would cover the case of matter in motion through the ether. 
The work of Lorentz was extremely exhaustive, and being 
based on his microscopic equations for the electron theory, it 
is not surprising to find that his macroscopic equations can be 
brought within the bounds of the Relativity principle, since it 
has just been shown that the microscopic equations pass the Re- 
lativity test. But, of course, to Lorentz the problem wasto trans- 
form from the equations (13) to (19); supposed true for matter 
at rest in the ether, to a form suitable for matter in motion 
through the ether. To the relativist, the problem is to discover 
the field equations and constitutive equations which, assumed 
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to be true in frame S, will transform to exactly the same form 
in S’ on the basis of the Lorentz transformation of the space- 
time co-ordinates, and to find the relations between the accented 
and unaccented symbols which will comply with this demand. 
It was Minkowski who first gave a complete solution to this 
problem ; complete in the sense that he gave a set of equations 
which satisfied the Relativity test, but left open to some extent 
the precise physical quantity which a given symbol was sup- 
posed to represent. But if one employs the method used in 
the chapter just concluded, it can be shown that equations (16) 
to (19) transform into an exactly similar group in accented 
symbols, provided 


EB,’ =E, 

E,' = a(E, — uB,) : » (20) 

Ey = a(E; + uB,) 

D,' = Dy 

D,’ = aD(y — uH,) ; : ea) 

De = alD,-- ui) 

Hi, == Hy 

H,’ = a(H, + uD,) (22) 

H,! = a(H, — uDy,) 

B,! = B, 

B,' = a(By+ uE,) | (23) 

BS lb, =| 

Ix’ = al jy = up) 

I Id (24) 
p = a(p — 4x) 


Suppose now that a material medium is at rest in S’, and 
therefore in motion through S with a velocity u parallel to OX. 
As it is at rest in S’, the simple constitutive equations (13), 
(14), (15) are supposed to be true, i.e., 


D’ = «E’ 
B’ — pH’ 
i: = ok’ 


for these are known to be true in the case of stationary, homo- 
geneous, and isotropic media, e, , and o being the values which 
would be found for the constants by ordinary laboratory 
experiments (in Lorentz units, of course). 

It follows that for the body in motion we have 
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D,=cE,; Dy—uH,=«(Ey—uB,) ; D,+uHy=«(E,+uBy) (25) 


By=pH; ; By+uE,=p(Hy+uD,) ; B,—uEy=p(H,—uDy) (26) 
a(Jx — Up) = Ex; Jy = oa(Ey — uB,) ; j; = oa(E; + uBy) (27) 


Let us now restrict the body to be a dielectric, so that 
o = 0 and therefore j = 0, and let it be in the form of a slab 
between two metal plates, which are connected to the ter- 
minals of an electrometer and are parallel to the plane OXY. 
Suppose a permanent magnetic field is established parallel to 
me amis Oly, so that He— 7, = 8, = B, = 0, and in conse- 
quence the slab is moving with a velocity wu at right angles to 
H. The equations show, and experiment verifies, that there 
must be an electric field and polarisation of the dielectric 
produced in the direction OZ, i.e., perpendicular to the direc- 
tions of the motion and the magnetic field. For, by (25), 


D, + uly = €(E, + uBy) 
and D2 =), = i, = Ly = 0 
By (26) By + uE, = p(y + wD,). 
Hence D,+ uy = cE, + eu(uH, + wD,) — ewE, 
or (r — enw?) D, = e(1 — w)E, + ulew — I)Ay. 

In the experimental work the apparatus is not sufficiently 
refined to take account of terms involving the square of uw; 
so neglecting these, we have 

D, = cE, + ulew — 1)Ay . : 228) 

If V is the difference of potential established between the 
plates and read on the electrometer, 

| he A | 


where d is the distance between the plates. 

Also D, is the surface density of the charge on the plates, 
and is therefore equal to KV where K is the capacity of the 
plate condenser per unit area. Hence 

(K — e/d)V = u(ep — 1)Hy. 
If the permeability of the medium is unity, we have 
(K — e/d)V = ule ~ 1)Hy. 


This equation was verified in a well-known experiment by 
H. A. Wilson in 1904 (‘‘ Phil. Trans.,” A, 204, p. 121). This 
_was before the appearance of Einstein’s first paper on Relativity, 
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and the experiment was really designed to decide between the 
above equation, which had been derived by Lorentz from his 
theory, and an equation which had been obtained previously 
by Hertz, using a different field equation for moving matter. 
Hertz’s result was 

(K — «/d)V = ueH,,. 


Wilson showed conclusively that the factor on the right-hand 
side was « — I and not e, thus disproving Hertz’s equations. 
As a matter of fact, Lorentz’s equation is not quite (28) 
above, but is 
D, = e£, + ule — 1)pH,. 


As pis practically unity for any dielectric, experiment fails to 
discriminate between this and the genuine Relativity result. 


CHAPTER V. 


WE have now become familiar with the most arresting feature 
of the analysis contained in the preceding pages, viz., the 
dependence of the time co-ordinate in one frame of reference 
on both the time and space co-ordinates in another frame. 
Another aspect of this is the intertwining of the measures of 
momentum and energy, of force and activity. An equally 
striking property of the theory is the derivation of the electric 
or magnetic vector in one frame from both the electric and 
magnetic vectors in another. Every physicist is familiar from 
the days of his first lessons in Mechanics with the fact that for 
the sake of mathematical analysis we resolve measurements 
of displacement, velocity, acceleration, force into components 
along directions which are entirely arbitrary. What is in 
reality an indivisible unity in our space, we divide up in a 
purely artificial manner for our own convenience, our justifica- 
tion being that on subjecting each portion to suitable mathe- 
matical treatment and reuniting the results, we arrive at a 
reality of our perceptual experience. Nevertheless, although 
the power of analysis by Cartesian methods is admitted beyond 
all question, the mathematical physicist has of late years been 
resorting more and more to the use of the methods of Vectorial 
Analysis, inasmuch as they keep well in view the unity which 
is apt to disappear when resolution into components takes place 
at the outset of the solution of some problem. 

Now one illustration of this underlying unity can be seen 
in the fact that when for some purpose or other we transform 
our axes, i.e., resort to a different mode of 1esolving, any one 
component in the new axes depends in general on all the com- 
ponents in the old system, and not merely on one. But this is 
just the feature we have signalised above in the analysis of Rela- 
tivity ; there it occurs not alone in connection with the space 
co-ordinates ; it exhibits itself in connection with both space 
and time co-ordinates.* Is it not possible then that in accepting 

*The fact that only one space co-ordinate appears in the most 
novel of the equations of the Lorentz transformation as used so far, 
depends on the special choice we have made for the axis of x. It is a 
feature which is of little importance and will be removed shortly. 
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the truth of the Relativity standpoint, we may find ourselves 
driven in the long run to admit also that Space and Time are 
not distinct and separate unities in themselves? May they 
not be two divisions of some underlying unity, which each of 
us separates in manner most convenient to ourselves, no one 
method of separation, however, being of greater fundamental 
importance than any other ? 

The concept of multi-dimensional space is not a novelty, 
and although we cannot visualise it, we frequently make use of 
its geometric terminology as a convenience and a means of 
economising speech and writing when dealing with the measur- 
able properties of physical systems—notably so in the subjects 
of Generalised Dynamics and Statistical Mechanics. In par- 
ticular, the existence of a fourth dimension has always appealed 
to the imagination of the scientific worker and philosopher. 
The notion that our arrangement of events in time order is a 
three-dimensional being’s mode of dealing with a static arrange- 
ment in space of four dimensions which would actually be per- 
ceived as such by a four-dimensional being, is as old as Greek 
philosophy. What we perceive at any instant is a part of a 
three-dimensional section of a four-dimensional world, the 
section being ‘“‘ normal” to the “ axis of time.’”’ Even when 
at rest in our space, any object is moving parallel to the axis 
of time. When an object moves in our three-dimensional 
space, it has a component of motion along the axis of time as 
well as along any spatial axes which we select. We may not 
go so far as to postulate the actual existence of such a four- 
dimensional space-time continuum, yet we make practical 
use of the idea every time we draw a graph representing 
displacement of a body and time. There, of course, we are 
dealing with a one-dimensional movement and time, and 
the paper or blackboard surface is two-dimensional. If prac- 
ticable, we could represent the graph of a two-dimensional 
motion and time in a three-dimensional frame, say by threads 
instead of chalk or pencil-marks ; but these are, after all, only 
material aids to our geometry. We can easily visualise the 
curves or graphs, because we can perceive three dimensions. 
When we attempt the same procedure for three-dimensional 
motion and time, our geometric intuitions fail because we do 
not perceive four dimensions. We have to resort to analogy. 
We assume that there is an axis orthogonal to any three 
mutually perpendicular axes which we choose in our space ; 
that the origin of our space axes is travelling along this fourth 
axis of time, but that there is a “ fixed” origin for our four- 
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dimensional axes, viz., the position occupied in this four- 
dimensional world by the origin of our space axes at some 
definite instant in the history of the events we are considering. 
It has become customary since Minkowski introduced the term 
to refer to this four-dimensional concept as ‘‘ the World” 
(“die Welt’’), retaining the word, space, for our ordinary 
three-dimensional perceptions. ‘‘ Space-time continuum ”’ is 
also used. In the World the history of a particle is contained 
in the curve which is the locus of all its positions in the World. 
If at rest in our space, this is a straight line in space-time parallel 
to the time axis; if in uniform motion in our space, it is a 
straight line making a constant angle with the time-axis; if in 
uniformly accelerated motion in our space, it is a parabola with 
the tangent at its vertex parallel to the time-axis, etc. 

Similarly, the history of a body is contained in a four- 
dimensional tubular space generated by the “‘ World lines,” 
““ Space-time lines,”’ of its various particles. When one experi- 
ences any difficulty in imagining this World, an appeal to the 
analogy of the three-dimensional World of a two-dimensional 
space will prove very helpful. One fact about this four- 
dimensional World we must grasp and bear in mind for our 
immediate purpose: just as in three-dimensional geometry the 
normals to a plane are all parallel to each other (if the 
geometry be Euclidean) and intersect the plane each in one 
point only, so in four-dimensional geometry, a given three- 
dimensional section has its normals parallel to each other, i.e., 
there is one and only one direction in space-time perpendicular 
to all the lines in a given space, and any such normal cuts 
this space in only one point, all other points of the normal not 
lying in this space. 

As stated, these ideas are not novelties; but their special 
importance for Relativity lies in the facts mentioned at the 
beginning of this chapter. In pre-relativity days one regarded 
the space sections of the World made by different observers as 
“ parallel’ to one another, i.e., as normal to one direction in 
the World which would be the direction of the common axis of 
time. This appears in considering the Newtonian transforma- 
tion for uniform relative velocity between the space-frames of 
two observers : 

Gy = a 
Xo! = XQ; Xs = Xz 
Hy = 4 


(where the suffixes 1, 2, 3 refer to space co-ordinates and 4 to 
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the time co-ordinates), We could make a still further trans- 
formation to space axes obtained by rotating the three space- 
axes OX,', OX,’, OX,’ about some line in space, but we should 


still retain 
X4 —— X 4: 


We can state this Newtonian procedure in a picturesque 
way by saying that all observers make the same space section 
(at a given moment) of the World, no matter what their relative 
velocity may be, or, in another way, by saying that the World 
lines of any two bodies at relative rest in space are parallel to 
one another and are in the direction of the time axis. But 
for the relativist each observer has his own axis of time in 
the World. Another observer, moving im space relative to 
the former, employs a different time-axis inclined to that of the 
former and so makes a different series of space sections of the 
World when thinking intelligently about his experiences. Of 
course, these geometrical statements are contained in the 
analytical statement of the Lorentz transformation : 


%y = a(%, — UX4) 
X_ = %_) X3' = Xe 
Le == a4, — U2). 


On consideration, it will appear that the space co-ordinates 
and the time co-ordinate in a frame S are related to those in 
S’, just as the four co-ordinates of a point in the World referred 
to axes OX,, OX,, OX3, OX,, are related to the four co-ordinates 
of the same point referred to axes OX,’, OX,, OX3, OX,’, two 
of the axes remaining unchanged, the other two being rotated 
in their common plane, and the units in which the first and 
fourth co-ordinates are measured being changed. In the figure 
the axes OX,, OX, cut the conjugate rectangular hyperbole 


427 — x2 = + a? 
in the points A,, A,. The lines OX,’ and OX,’ are a pair of 
conjugate diameters inclined to the lines OX,, OX, at an angle 
6 given by 
tan 0 == u. 


These cut the branches in A,’, A,’. 
The equations of the hyperbole referred to OX,', OX,’ 
are, as is well known, 
Hy'% — m4 ae 4g’, 


where a = OA One 
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and it is not difficult to show that 
OM'/a’ = a(OM/a — u . MP/a) 
M’P/a’ = a(MP/a — u . OM/a), 
where a = cos 6/,/cos 20 = 1/,/(1 — tan? 6) = 1/,/(1 — u?). 
Hence if we write 
%,= OM/a; x, = MP/a 
%_ = OM'/a’'; x,’ = M'P/a’, 
i.e, employ @ as unit of length and time in one frame, and a’ as 
unit in the other, we reproduce the Lorentz transformation. 


7] A 
ay M | 
CB) | i 
0 lA, ™M 
ee 
Fic. 4. 


The two axes OX,’ and OX,’ would still be perpendicular 
to OX, and OX;, but not perpendicular to each other, that 
is, from the point of view of the observers in S. Of course, 
observers in S’ would consider all their axes as mutually ortho- 
gonal, but the axes of x, and x, employed by S as not perpen- 
dicular to each other. This is a fanciful way of stating that 
units of length and time are different, and, further, that 
_ simultaneity is a relative conception.. To give ourselves a 

chance of visualising this, consider a two-dimensional space with 
particles moving in it. The World will now be a three-dimen- 
sional one, and the World lines of these particles will be definite 
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curves in it. A plane parallel to OX,X, will cut these World 
lines in points which represent simultaneous positions of the 
particles from the point of view of one observer. From the 
point of view of an observer moving relative to the first with 
a velocity « parallel to OX,, simultaneous positions of the 
particles will be represented by the intersections of the World 
lines with planes parallel to OX,’X,. 

There is, of course, no necessity to restrict ourselves to an 
alteration in only two of the axes. In the earlier chapters 
dealing with uniform relative motion we did so, and gave a 
fictitious importance to the axis of x, which it assumed because 
we chose it to be parallel to the direction of relative motion of 
the frames S and S’. But, after all, we are quite familiar with 
changes of axes in space, and for any given time axis in the 
World we can have an infinite number of sets of space axes, ViZ., 
any triad of lines in our space, or, rather, the positions of these 
lines in the World at a definite originalinstant. It is easy to re- 
move the importance of OX. Thus, reverting to ordinary nota- 
tion for a little, we choose axes in S such that the direction cosines 
with respect to them of the line OO’ (i.e., the line of relative 
motion of S’ to S), and two lines at right angles to it and to 
each other are (21, 71, 23), (Le, Mo, M2), (ls, M3, M3). The axes in 
S’ are such that these same three lines have direction cosines 
(Ly, my’, 11’), (Lo’, M2), 22’), (1s’, Ms’, M3’). Then the first three 
of the Lorentz equations become 


Lyx’ + my'y’ + ny'2’ = a(lyx + my + nyz — ut) 
Lex’ + mo'y’ + n9'2’ = 1% + mey + nee 
s'x’ + ms'y’ + N32 = 13% + may +- N32. 


Solving for x’, y’, 2’, we obtain 
4! = (aby 1, + 19'Le + 1,'lg)% + (aby'my + Le’m, + 15'ms)y 
+ (aly'n, + 1,'n, + 1,'n3)z — aul,'t, 


and two similar equations. 
If we chose axes so that 


ie = Ly Mm,’ = M,, Ny" = Ny, (v = I, 2, 3), 
these equations simplify to: 
x’ =(a—1 - 4y?+1)*4+(a — Ill ymy+(a — 1)l nz — aut 
y= (aI), x+ (a—1 . my2+1) y+(a—1)m,n,2—aU,t (A) 
2'==(a—I)nylx+(a—I) . nymyyt(a—1 . ny2+1)z—au, 


where ty, Uy, 4; are components of u along the axes. 
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In connection with this latter choice of axes, it must be 
noted that they do not coincide at = 0 = ?’, e.g., if t= 0, 
the point (I, 0, 0) in S, which lies on OX, does not lie on OX’, 
for its co-ordinates in S’ are (a —1)J2+1, (a — 1)ml,, 
(a —1)n,/,. This, in fact, illustrates an earlier remark that a 
pair of lines which are at right angles from the point of view of 
S are not in general at right angles from the point of view of S’, 
and so could not be simultaneously (to S’ observers) coincident 
with a pair of lines which are perpendicular to each other from 
the point of view of S’. 

The fourth of the Lorentz equations is easily obtained for 
general axes. It is 


Y= alt — u(lyx + my + nyz)} 
or t! = — aux — atyy — au,z + at ; < teB) 


In vectorial notation equations (A) and (B) can be written 
as follows, where r and r’ represent the radii vectors OP and 
Ore: 

r’ =r—datu+(a—i1)(u.rju/w . pane) 
b= afi— (aT). 2 i(D) 


In (C), (a. 1r)u/w? represents a vector in the direction of u 
having the same magnitude as the resolved part of r in the 
direction of u. 

The reciprocal equations are : 


r=r4+a/u+ (a--1)(u.r’)u/uv? . ate) 
ta’ +(u-r’)} . : ; 


To return once more to a consideration of the space-time 
World, we see that the more general Lorentz equations express 
a general rotation of all the axes, and are not like the simpler 
form of the equations, concerned with a rotation “ in the plane ”’ 
of the time axis and one space axis. The essential feature of 
any of the forms, simple or general, as contrasted with any 
forms of the Newtonian transformation equations, is the impli- 
cation that the axis of time is in different directions in the 
World for observers in relative motion to each other in space, 
i.e., that their space sections are different, which is our geo- 
metrical way of saying that a group of simultaneous events for 
one observer cannot in general contain all the events in a group 
of events simultaneous to the other observer. 

There is one feature about the rotation which cannot escape 
notice. As far as the space axes are concerned, they experience 

a rotation as a rigid frame and remain at right angles to one 
8 
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another when we use the simple Lorentz transformation, while 
the time-axis suffers a rotation with respect to one of the 
space-axes, which, however, maintains it in a certain geometric 
relation to that space axis, viz., as a diameter of a certain 
rectangular hyperbola conjugate to the space-axis. In the 
more general transformation, none of the axes preserve the 
relation of orthogonality, but they preserve the relation of a 
conjugate tetrad of diameters to either of the four-dimensional 
hyperquadrics 
Hy" + He? + X57 — 44" = 4-T. 


For those who are unfamiliar with the concept of four 
dimensions, the appeal to analogy is again helpful, and it will 
be found that if we limit our equations to a two-dimensional 
space and a three-dimensional World, the equation 


My" Ay = Hy 
transforms into 
My he tg == ee, 


and if OX,, OX,, OX, are a set of conjugate axes for these 
hyperboloids, then so also are OX,’, OX,’, OX,’. In our four- 
dimensional World it is the mutual conjugacy of the four axes 
with regard to a definite hyperboloidal hyper-surface which is 
the invariant property of any one of the sets of axes connected 
by a general Lorentz transformation. When we say that in 
stepping from one frame to another in space, we rotate our 
four-dimensional axes, we imply that the rotation is of this 
nature, and not a rotation analogous to the rigid rotation which 
preserves orthogonality in three dimensions. The objection 
may be urged that the three space-axes have been chosen to 
be mutually orthogonal; that is so—to observers at rest 
relative to them. Each group of observers chooses orthogonal 
space axes to suit their own convenience, but such axes are not 
in general orthogonal to other observers in relative motion ; 
that feature was emphasised above. Appealing to geometry 
of three dimensions once more for an analogue, we note that 
any plane does not necessarily intersect two planes perpen- 
dicular to each other in perpendicular lines ; a certain limited 
class of planes does so, but not all planes. 

The analysis of the general Lorentz transformation is the 
analysis of a hyperboloidal hyper-surface in a four-dimensional 
space referred to any four conjugate diameters as axes. This 
ete from the fact that with any Lorentz transformation we 

ave 
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2 pte oats Pan , ie 
Ny 4- oe a x“ — a —- xy 2 -[- x * -- xe" as OF 
or 4° x7 +- x, — x! 


is invariant ; and the minus sign before the last term introduces 
the analysis suitable for hyperbolic figures. Had the plus sign 
occurred before all the terms, we should have been dealing 
with the analysis of a spherical hyper-surface. 

It is, however, possible to get rid of the minus sign and give 
the analysis a formal similarity to the analysis of orthogonal 
transformation in four dimensions. If, instead of writing x, 
for the time co-ordinate, we put x, equal to the product of 
./(— 1) and the time co-ordinate (“imaginary time ’’), we see 
that 

Hy? + X27 -+- X57 + x? 
remains invariant for a general Lorentz transformation. Thus 
the simple transformation becomes 


%, = ax, + vax, 


X— = %y; Xs = Xs = /(— 1)} 
Lg = aX, — 104%, 


so that if we introduce an imaginary angle ¢, for which 


tan d = 
sec ¢ == ./(I — u?) 
cos ¢ = a 


sin ¢ == val, 
we can write 
x’ =%x,cosd¢+x%,singd 
Xe; Xs) = Xs 
: — x,sind + %, Cos ¢, 


4 
a 


which corresponds to a rotation of the axes OX,, OX, in their 
own plane through an angle ¢, the axes remaining at right 
angles. 

The general Lorentz transformation corresponds to a 
rotation of all the axes in the World in which the space-axes 
and axis of imaginary time remain orthogonal. The most 
convenient way of writing it is: 


Hy" = Ayy% + Ayo%_ + Ay3%3 1 ne | 
Hy" == AgyX + AgoXe + @o3X3 + Aoa%q 
Ng = Agy% + AgoXo + Aga%_ > AgaXq 
Hq! = Agy%y + AgoXy + Mag%3 1 Aaa a 
where the a coefficients are constants depending on the relative 


movement of the space frames considered, and can be easily 
8 * 


(I) 
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interpreted from the earlier expressions, remembering that «4 
now is equal to 1x, in the former equations. It will be seen 
that the nine coefficients involving the digits 1, 2, 3 in their 
suffixes are real, the six involving the digit 4 once in their 
suffixes are imaginary, while a,, is real and equal to a, or 
I — u?)—4, 
mane further progress there is no need to make use of 
the values of the a coefficients in terms of w and the direction 
cosines. 
Since 
oe Da 

it follows that 

Ay + Gey + ag? + aq? =I. - (2) 


and three similar equations, and 


A134 — + AgyAo9 + 431439 + Aq1Agg =O (3) 


and five similar equations—ten equations in all. 
It considerably economises labour to introduce the summa- 
tion symbol and write (1) as 


’ 
aN eae ZAyaHas 


where the summation extends over the values I, 2, 3, 4 fora; 
putting A equal to I, 2, 3, 4 in succession, we have the four 
equations (1). Indeed, since the sufhx a appears twice in the 
term which has to be summed, we can omit the symbol = on 
the understanding that any suffix which appears twice in a 
term on either side of an equation is a ‘dummy,’ ‘i.e., the 
term has to be summed for values I, 2, 3, 4 of the repeated 
suffix. We shall for convenience employ the earlier letters of 
the Greek alphabet, such as a, f, « for dummy suffixes, and Jater 
letters such as x, A, » for suffixes which are not dummy. 
Thus we write equations (1) in the concise form 


pat = araXa . . . . (x) 
and conditions (2) and (3) in the form 


Aude, =LTifrA=p : : ~ (2) 


=oOifA+p.  . : a(3} 
Arising out of equations (2) and (3), the following results 


are true and not difficult to establish : 
Theorem I.—The determinant, 
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ary Qi2 43 Q4 
Go, ogo Aes Ue4 
| 31 a32 A33 a34 
| May ae M43 Aaa 


is equal to unity. 

Theorem II.—Any constituent of the determinant is equal 
to its co-factor.* 

Theorem IIJ.—Any second minor of the determinant is 
equal to its co-factor. 

Theorem IV.—Equations similar to (2) and (3) are true if 
the determinant is read by rows instead of columns, that is 


Rao tet A ‘ . 4) 
=0,1fA + pw : : (5) 

and, further, 
Fs as onl : , ; ee (a) 


(N.B.—a,, is not to be confused with a,, ; it is not in general 
tae that 2, = d,,.) 

From this point onwards let us accept the lead which has 
been given to us by our equations, and admit the reality of 
the space-time continuum. The objection that it cannot be 


‘ 


“ perceived ”’ is trivial. Neither can space or time be “ per- 
ceived ’’ in the sense which is implied in that use of the word 
““perception.”’ Space and time are modes of perceiving things. 
Relativity has taught us that the two modes are not independent 


* The sign of the co-factor must be carefully noted. Thus the co- 
factor of a,, is the first minor obtained by removing the first row and 
first column, but the co-factor of ag3 is minus the first minor obtained 
by removing the second row and third column, because in order to 
bring the constituent agg into the leading place we must first of all dis- 
place the second row into the first row by one step and then the third 
column into the first column by two steps, three steps in all, i.e. an 
odd number. If the steps required in any case are even in number the 
co-factor is the first minor with sign unchanged. 

Similar adaptation of the sign must be made in the case of the co- 
factors of second minors. Thus taking the second minor 


| Arn A254 
39, 34 
its co-factor is 
ay ai3 
By, 43 


for it requires one step to transfer the second column to the first and 
then two steps to transfer the fourth column to the second ; another 
step brings the second row to the first, and still another the third row 
to the second. Thus five steps in all (an odd number) brings the first 
of the second minors written above into the leading place. 
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of one another. The manner in which they depend on one 
another is special to each observer ; but for human beings, on 
a limited portion of the earth’s surface, who use nearly the same 
frame of reference in life’s ordinary affairs, the time axes are 
so nearly parallel that all their space sections are practically 
the same. Hence arises the illusion of the independence of 
space and time. The question is not one of philosophical 
interest alone. We have just seen the elegant and symmetrical 
form into which the Lorentz equations can be thrown, a form 
in which any distinctive feature of the time co-ordinate as such 
disappears. But this is no accident. Not only these initial 
equations, but all the equations of the Restricted Relativity 
theory can be exhibited in a similar elegant form which is a 
perfect analogue in four variables of the Analytical Geometry 
and Statics of the text-books. For this purpose we employ the 
branch of mathematical analysis known as ‘‘ Tensor Analysis.” 
If its sole use were to provide a pretty garment for the restricted 
theory of Relativity, the critic might well say that the mathe- 
matician had allowed to his xsthetic sense an undue importance. 
But as we proceed further we shall see that the great stride 
from the Special to the General Relativity theory, which has 
created such a revolution in our ideas about gravitation and 
also electromagnetism, could hardly have been made, certainly 
not with such powerful effect, had this branch of analysis not 
been ready to hand. Indeed, the very generalisation of it 
which is required in its application to the general relativity of 
all phenomena, has not only guided us to the formation of 
geometrical concepts concerning the four-dimensional world, but 
is actually impelling us towards researches, which may have as 
their outcome success in the measurement of such geometrical 
properties. With such ideas and possibilities before man’s 
mind, it is idle for the critic to refer to the reality of space-time 
as a pointless speculation. 

We shall, therefore, in the next two chapters, give an exposi- 
tion of Tensor Analysis as required for the special theory of 
Relativity. This will lead us naturally to the second part, where 
we shall proceed to develop it in connection with the idea of 
General Relativity, which abandons the restriction that all frames 
of reference must be in uniform relative motion to each other. 
In all this, however, we shall keep physical phenomena, as 
such, well in view. When familiarity with the mathematical 
weapon has been assured, the final chapters will put before the 
reader the thought that all phenomena are but some aspect of 
a “ World Geometry.” 


CHAPTER VI. 
VECTORS. 


Any of the usual directed quantities or vectors in Physics is 
said to be represented by a point in space, if the displacement 
from an origin O to the point has a direction parallel to the 
direction of the quantity and a length proportional to its 
magnitude ; so that the components of a vector are repre- 
sentable by the co-ordinates of a point. In consequence, a 
change from one set of rectangular axes to another set with the 
same origin involves relations between the new components of 
a vector and the old which are entirely analogous to equations (I) 
of the last chapter (the suffixes being confined to 1, 2, 3), 
provided the coefficients are connected by equations analogous 
to (2), (3), (4), (5)- 

In the same way, if we have any group of four quantities 
which on transformation are related as (%,', %9, *,', %4') are 
related to (1, %9, ¥3, ¥4) by equations (1), we refer them as com- 
ponents of a vector in four dimensions, a “ four-vector.” If 
P,, P,, P3, P, are the components of a four-vector P along the 
ee, OKO, OX, and P,, Ps, fs, , ate the.com- 
ponents of the same vector along OX,’, OX,’, OX,’, OX,’, then 


eo hs tel gt = Py id a yo | ne 
or POP, is invariant. 
This is a particular case of a general theorem (which can be 


easily proved by an appeal to equations (2)-(5)) that if P and Q 
are two four-vectors, then 


tO ae SS wpe 


in which the recurring suffix implies, as usual, a summation of 
four terms or P,Q, is invariant. 
We call this the geometric product or scalar product of the 
two vectors, and it is frequently denoted by the symbol (P. Q). 
If the scalar product is zero, we say that the vectors are 


perpendicular to one another. 
11g 
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Conversely, if any set of four quantities gives an invariant 
product when multiplied geometrically by an arbitrary four- 
vector, it must constitute a four-vector. 

As may almost be anticipated without proof, all those groups 
of four quantities which we found in the earlier chapters to be 
subject to a simple Lorentz transformation constitute four- 
vectors, or rather they do so when the fourth member of each 
group is multiplied by .. 

Thus, if (%1, %2, %3, %4) are the space- (imaginary) time co- 
ordinates of a point-instant, and x, + 6%, ...,...,%, + 6%4 
those of an adjacent point-instant, 


200 20%, *. 


This invariant we write as 5s?; ds is, in fact, an element of the 
world-line of a particle which was situated at the first point at 
the first instant, and at the second point at the second instant. 
It is clear that 5s? = — 67?, 57 being the proper time of the 
earlier analysis. Now, 6%,, etc., transform just as %, etc., do, 
and constitute the components of a four-vector. Since ds is 
invariant, it follows that 


AGES wea eee os 


are the components of a four-vector. They are evidently 
analogous to the direction cosines of an element of a curve in 
three dimensions. 

Before proceeding to further examples of four-vectors, we 
can deal with another important invariant. By a well-known 
theorem in transformations 


ON4 ON ON, OX, = J OVO —OXOv, 


where J is the Jacobian of the transformation.* But this 
Jacobian is the determinant of the @ coefficients, which, by 
Theorem I., is equal to unity. Hence 


8% 10% OX 0% 4" = 8% 9% .0%,0% 4, 


or an element of four-dimensional] volume is invariant. 
We are already familiar with this result in another guise. 


* The Jacobian of the transformation is the determinant 


om1'/d*, 0%'/0%2, 041 /d%3; 0%'/0%4 


Das Dap, Hs! [Das Dita [Dar Dg [D4 
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If vy, vy, ¥; are the components of the velocity of a particle in 
a frame S, then 
Vz, = 6x%/St = 8%,/5x,, etc., 
and therefore 
v? = Su,? = — (6x7 + 5x, + dx57)/8x,? = (8% 42 — 852) /8x,2. 


Hence 
ax,/ds = (1 — v*)—* = BP. 
Consequently, 


bx4/5x4' = B/B" 


where v’ is the velocity in a frame S’ and f’ = (1 — v’*)—3, 
Hence 
POt,O% O%,— BOX, OX, OX, , 


or an element of (three-dimensional) volume of a body as 
measured in a frame S varies as 1/8 or (I — v?)?, where v is 
the velocity of the body in the frame, a result which we deduced 


in Chapter II. 
It follows at once that 


ax,/ds = — Bo, 


dx,/ds = B 
and (multiplying by the factor x), we see that Buz, Buy, Buz, oP 
are components of a four-vector, i.e., 
B'vg' = Ay1B0x + Ay 2BVy + 1380, + A148 
and two similar equations, and 
UB! = Ay, BVy + AgoBvy + Ay3PU; + Gay. 


But, after all, these transformation equations of the “ velo- 
city four-vector ’’ can be written more succinctly thus : 


Wi \ds = Gd ds; 


in which is implied the whole of Einstein’s kinematics. 
In precisely the same fashion the transformation of accelera- 
tions is summed up in the equations + 


O4ds* = G,d-%,/ds", 


_or in the statement that 
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dix, /ds*, . .., <5 ., de ds* 


constitute a four-vector, the ‘‘ acceleration four-vector.” In 
terms of the three-dimensional velocity components of a par- 
ticle’s motion, these are 


d(Bv,)/dt, . . ., . . ., aBldr 
or Pa(Bu,\jdt, «= s,0s = =, ea plat. 
Since 
&(dx,/ds)? = 1, 
it follows that 
ax,{ds. a*x,/ds* = 0, 


which we express by saying that the acceleration four-vector 
and the velocity four-vector are at right angles to one another. 
This is really a generalised form of the equation (4) of Chap- 
per tl 

Likewise the equations of motion and of energy of the 
third chapter are summarised in the equations 


d(pdx,/ds)/ds = 


where p is an invariant and P, is a new four-vector, the “ force 
four-vector,’”’ or, more strictly, the “ force-activity’’ four- 
vector. For -Py. P, are easily seen to be the — fF,, 
cups LeU F) of that chapter, so that the equations for 
transformation of force and activity are succinctly written : 


eke = LBs ae 


Similarly, »dx,/ds (after multiplication by x) are the mv,z, . . ., 

., um of Chapter III., and so constitute the ‘‘ momentum- 
energy ”’ four-vector. 

Having rather hurriedly clothed the analysis of the earlier 
chapters in its vector symbolism, we should, to complete 
the analysis, demonstrate the four-vector property of the 
quantity P on the basis of the electro-magnetic equations. 
But in order to turn the analysis of Chapter IV. into vectorial 
form, we must extend our knowledge of vectorial mathe- 
matics somewhat further, so that we can not only achieve this 
purpose, but also use this powerful mathematical weapon to 
deal in the following chapter with the dynamics of continuous 
media and free ourselves from the narrow atmosphere of particle 
dynamics. 

If J(%, %2, ¥3, ¥4) be a scalar function which transforms into 
b(%4", Xa", Xs’, %4') by the general Lorentz transformation (1) of 
Chapter V., we see that 
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dyb/dx4' = d6/d%, . 9K,/9Xy' 


and three similar equations. 
By (6) of the last chapter, 
Oe 02, = 42,7,7etC. 
Hence 
dp /dxy' = Ay.d6/ 0%, 
and three similar equations ; or 


dy/dx,’ = AyadP/d%X aq. 
Hence 0¢/dx, constitute the components of a four-vector, 
which we call the Gradient of ¢. 


If P, is a four-vector which transforms into Pe aitvis*not 
difficult by (2) and (3) of the last chapter to prove that 


dP, |X. = IP,’ /d%,', 
or 0P,/dx, iS invariant. 


By analogy with a similar result in three-dimensional 
analysis, we call dP,/dx, the ‘‘ Divergence”’ of the vector, and 
write it Div P, using the capital letter D to distinguish it 
from a three-dimensional divergence of a “ three-vector,”’ 
which we shall write “ div.” 

ahs s/o, -- ., - .., 0/d%, are the four components of 
a four-vector operator. Operating on a scalar function we 
obtain a vector function, the Gradient. A symbolic geometric 
multiplication of this operator and a vector yields an invariant 
scalar function, the Divergence. 
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Let P and Q be two four-vectors, and let us write down in 
square array the following sixteen products : 


P,Q1, LO; P,Q, P,Q, 


PiQ, PsQ. PaQs, Pie 
If we transform to other orthogonal axes (the general 
Lorentz transformation), we can write down the sixteen corre- 
sponding quantities : 
PIO 


_where P,’ and Q,’ are the components of the same vectors 


124 RELATIVITY 


along the new axes; it can be shown that each of the sixteen 
new quantities is a linear function of the original sixteen. In 


LEYS 
Py'Qu = QobupP Qe, 


which summarises sixteen equations with sixteen terms on the 
right-hand side of each (for we sum for each of the two dummy 
suffixes a and f). 

Two properties of such a group of sixteen quantities are 
important. 

1°. If R is an arbitrary four-vector, and the constituents 
of each row of the array are multiplied by Ry, Re, Rs, Ry 
respectively, and the four terms of each row added, we 
obtain four quantities, which are the components of a four- 
vector, i.e., are related with the transformed four by equations 
similar to (I) and (6) of Chapter V. Thus, in unaccented letters 
the quantities are 


(Oalea le Pte on ater eco) sajhl Oates ergs 
With accented letters they are 
(ORE une iar wae een Chacha) Rear 


But 
Q.R, = QR,’ =(Q.R). 


Hence in each case we obtain the components of the vector 
P(Q. R). 

A similar theorem is true if we substitute ‘‘ column ”’ for 
“row ’’ in the enunciation, the vector in this case being 


Q(P . R). 


« 


2°. If instead of multiplying by the components of a four- 
vector we operate in a similar fashion with the components 9/dx, 
of the vector operator, we obtain a four-vector; for since 
0Q,/2%, = Q.'/d%.’ = Div Q the results are the components 


P,Q0900): . oa) ee PO ee 


of the four-vector P Div Q. 

Any set of sixteen quantities which are transformed like the 
sixteen quantities P,Q, constitute a ‘‘ tensor of the second 
order.”” We denote such a tensor by double suffixed symbols, 
such as P,,, Qy,, Ry,, etc., or by symbols’such as [P, Q, 1R. A 
four-vector P, or P is a tensor of the first order. The sixteen 
components of P,, are denoted by 
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Py Pre P43 Pr 


Pu Pas P43 Pay 
and the fundamental transformation equation for a tensor of 


the second order is 
, —_ 
P= Qfheh ap: 


By means of this equation and equations (2)-(5) of the pre- 
vious chapter, it is not difficult to prove these two theorems for 
any tensor of the second order, viz., 

T°. Py1Q1 + P1202 + PisQ3 + PiaQ4, and three similar ex- 
pressions (or as we may succinctly write them, P,,Q,), are 
components of a four-vector. We denote it by the symbol 
(Q. Pj. 

2°. dP,,/d¥, are components of a four-vector. Since each 
component of this vector resembles a divergence, this vector is 
sometimes called a “ vector-divergence’”’ and written Aiv P, 
the Greek A being used to distinguish it from the scalar diver- 
gences of three- and four-vectors, div and Div. It ought clearly 
to be understood, however, that any one of the components, say, 


OP 44/d%1 + OP y9/d%_ + IPy3/d%3 + OP yy/d%q, 


is not a scalar divergence, for P,,, P42, P13, Py, are not com- 
ponents of an imvariant vector. Py, P42, P43, Py, could, of 
course, be regarded as components of a vector, but it is not the 
same vector which has components P,,’, Py,’, P43’, Py,’ with 
respect to the altered axes, because the relations between the 
P,,’ and P,, are given above, and are not 


Py = @yPy, + «~~. + 414P ay, ete. 


To avoid any misapprehension of this sort, it is as well to use 
another name for the four-vector dP,,/d%,, and it is frequently 
referred to, especially in English books, as the “ Lorentzian of 
[P,” and written Lor Pp. 


SYMMETRIC AND ANTI-SYMMETRIC TENSORS: SIX-VECTORS. 


If a tensor of the second order has components which satisfy 
the relations ; 
Py Px , 
it is called ‘“‘ symmetric.” 
If its components satisfy the relations 
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1 oh ed a 


pe 


it is called “ antisymmetric.” This involves the condition that 
Py, =P,,=—P;, =Pa= 0, 1.65, the leading ~~ components 
of an anti- -symmetric tensor are all zero. 

If P and Q are two vectors, then P,Q, — P,Q, constitute 
the components of an anti-symmetric tensor. 

An anti-symmetric tensor of the second order contains only 
six numerically different components, and selecting the six 


ena Ps, Ps, Pig Pay Poe 


these are frequently referred to as the six components of a 
“ six-vector.”’ The reason for the name is based on the fact 
that if one takes (%1, %s, %3, ¥4) and (74, Vo, V3, Va) a8 the co-ordi- 
nates of a pair of points A and B in four dimensions, then 


X%eV3 — %3¥o, %3¥1 — %1V3, %1V2 — %241; 
XyVqa— %aV1, %2Va — %4V2, %3V4 — XaVap 


are respectively double the areas of the projections of the 
triangle OAB on the sx co-ordinate planes OX,X;5, OX;X,, 
OX,X., OX,X,, OX,X,, OX;3X,4. In three-dimensional geo- 
metry corresponding expressions (three in number) are known 
to constitute the components of an invariant three-vector 
(“‘ axial ’’ vector), whose magnitude is double the area of the 
triangle OAB, and whose direction is normal to its plane OAB. 
By analogy the six projections above are referred to as com- 
ponents of a six-vector, whose magnitude is double the area of 
the triangle OAB, and whose “ orientation ”’ is the plane of the 
triangle. (As in four dimensions there is an infinity of lines 
perpendicular to a plane at any point of that plane, and as 
all these lines lie in another plane intersecting the former 
plane in the one point only, we cannot speak of the “ direc- 
tion’ of asix-vector.) In LSP P and Q being four-vectors, 
PLO; — PQs - + +» - - Sr a) Pood SSA 8) a the 
six components of a six- -vector ; and since P,Q, — P,Q, is 
an anti-symmetric tensor, we can also say that if [ is any 
anti-symmetric tensor, six of its components properly selected 
constitute a six-vector, P,, being the component of the six- 
vector in the plane Oxo etc. Asa matter of fact, the sixteen 
equations of transformation for a tensor of the second order, 
with sixteen terms on the right-hand side of each, reduce, in 
the case of an anti-symmetric tensor to six equations with six 
terms on the right-hand side of each. For example, 
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, 
Py’ = AyaAyph. ap 
— Oo, 
since 
PS Pend P= 0 
Also 
£ 
Py = Ayatyuph op a Ay petrol pa —- - Poe 


So the anti-symmetry is preserved after transformation, and 
we can write those equations of transformation, which are not 
either zero identities or duplicates of others, in the form 


A A r A d r 
Py, = a Pos iene 31 Aug Pyst a Pyst Si Past (Mt Ue 
where | on is a contraction for @,,@,, — @,,@, and Au is re- 


placed consecutively by 23, 31, 12, 14, 24, 34 in order to give 
the six equations. 
It will be convenient for the moment to write the thirty-six 


coefficients Ms Be Aare ino rg: Ubis & Acs Cees on Oe 
so that, for example, 

23 2 it 
=e bis = Wi ba = 2 oe b 


The equations just written are then 


Bi’ = bi Di = Drs P; ss Dis Ps a Dxa Ba, “i Das Bs =“ Dig Pg (I0A) 


=: b,,[P, (summing for a from 1 to 6 


By Theorem III., of Chapter V., concerning the second 
minors of the determinant of the a-coefficients, it is easily seen 
that if we write the 36 b-coefficients in a similar square array, 
the constituent in row 7 and column s is equal to the con- 
stituent in the row (vy + 3), and column (s + 3) (where 7 and 
s can be replaced individually by 1, 2, or 3). Making use of 
this, we can see that the transformation equations for a six- 


vector are equivalent to 


Bp.’ = 0,,D. + O:Ms + S156 + Gra. + F152 + Fics 
By = = SelB 4 ie bal, e bale + Baas + Bas + Bal 


a a bee == eS | etc, 


Bs’ = buds + bel; + besle + Deals + F62lh2 + FslPs. 
Which proves the proposition that if 
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Is P31, Pas, Weare Pag Prxq 


are the components of a six-vector in the planes OX,Xz, . . 
OX,X,, respectively, then 


1a gas Ps Tas Poy Pr. 


<9 


are the components in the planes OX,X,, . . ., OX,X, re- 
spectively of another six-vector. 

The second six-vector is said to be “reciprocal to,” or 
‘associated with,” the first. 

The six-vector P,Q, — P,Q, is called the vector product of 
the two four-vectors P and Q and is generally represented 
symbolically by [P. Q]. 

In particular, the six quantities 


wW,,/0% — IP,/d%,, 


which constitute the symbolic vector product of the four-vector 
operator 90/d%, and the four-vector P,, form a six-vector. It 
is called the Curl or Rotation of P, and we write it 


Curl Por Rotse, 


capital C or R being used to prevent confusion with the well- 
known three-dimensional curl or rotation. (The symbol Rot is 
frequently used in continental works. English authors generally 
prefer Curl, following Heaviside’s example.) 

The square array of the thirty-six b-coefficients above can 
be shown to have properties similar to some of those possessed 
by the array of the sixteen a-coefficients. 


Thus 
O80 a = Na 
=oifA+upu 
O50; sk 
= OTA 


(the summation for a being from I to 6). 
By reason of these we can prove that 


Pp.’ BP.’ = DB. B.- 


This invariant is therefore the square of the magnitude of the 
six-vector [P. 


If [P and iR are two six-vectors, it can be shown likewise that 
P.R, = P.'R.’, 


i.e., [PIR is invariant. The sum of these six terms is referred 
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to as the “scalar”’ or “‘ geometric ”’ product of [P and 1, and 
can be denoted by ([P . 1k). 

One more result is required for the purposes of Relativity. 
It refers to a tensor derived from a six-vector Ff, as follows: 
Let F, be a component of F, and R,, be the corresponding 
component of the reciprocal six-vector, so that R,, = F 


14) 
.» Rig = F23. The tensor in question is 


Dy = FF ua — RK, (the summation for a is from 1 to 4). 


That T,, is a tensor can be proved most readily by working 
out the values for the four quantities 97 j/d%,. It will be found 
that 


IT 9/ De = Py. dF gal d%Xp 2 es Ry.dRgo/d%Xp 


= Acta” Aa ay 


where P and Q are the four-vectors Lor J and Lor iR respec- 
tively. 

Since F,, P, and R,, Q, are the components of the four-vector 
products [P. Jf] and [Q. 1R}, it follows that dT, /d%, are com- 
ponents of a four-vector. We know that if U,, is a tensor of 
the second order, then dU,,/dx, is a four-vector; and the 
converse is equally true and not difficult to prove. Hence T), 
is a tensor of the second order. It is a symmetric tensor ; 
moreover, it is a special type of symmetric tensor, for on 
examination it will be found that the sum of its leading 
components, 

: De gs re) ng a4 
is zero. 

Having dealt with the vectorial material which is necessary 
for our future progress, we can recapitulate what that material 
is. It consists in the main of the following nine functions and 
their properties : 

(x) The invariant scalar product of two four-vectors, (P. Q). 

(2) The invariant scalar product of two six-vectors, ([D.@). 

(3) The six-vector product of two four-vectors, [P. Q]. 

(4) The four-vector product of a four-vector and a tensor of 
the second order, and in particular of a four-vector and a six- 
vector, [P. P]. 

(5) The tensor derived from a six-vector and its reciprocal. 

(6) The four-vector Gradient of a scalar function. 

(7) The scalar Divergence of a four-vector function. _ 

(8) The six-vector Curl or Rotation of a four-vector function. 

(9) The four-vector Lorentzian of a tensor of the second 
order, and in particular of a six-vector. 

2 
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EQUATIONS OF THE ELECTROMAGNETIC FIELD. 


The essential feature of these vectors and tensors is the 
existence of linear relations (with constant coefficients) which 
connect the transformed components with the originals. Hence, 
if the laws embodying physical results in a definite frame can 
be expressed in equations between such tensor functions, the 
equalities are preserved after a general Lorentz transformation, 
i.e., the laws are true for physical results in any other frame 
moving with a uniform velocity relative to the first. This 
tensor analysis is the natural mathematical medium for expres- 
sing laws of nature in the invariant form required by the first 
postulate of Relativity, and we have seen how it can be employed 
in the treatment of kinematics and the laws of motion. We 
now pass on to the treatment of the electromagnetic field and 
other problems arising out of it. 

In order to avoid confusion in our symbolism between 
three-vectors and four-vectors, we will keep small Clarendon 
and italic type for the former, and capital Clarendon and 
italic type for the latter. Thus the electric vector will be 
denoted by e, components e,, é,, e¢, ; the magnetic vector by h, 
components hy, hy, h,; velocity of electrified matter at a point 
by Vv, components vy, vy, v;; and electric density by p. 

The equations of the field are 


curl h — d3e/d¢ = pv (x) 
dive =p : : 2) 
curl e -+ dh/% =o : ae} 
divh—o : (4) 
or, in Cartesian form, 

dhz/ dy — dy/dz2 — d€,/d¢ = py = (ER) 

and two similar equations, 
d€x/d% + de,/dy + de,/d2 = O : ees) 
dé,/dY — dey/dz + dhy/d¢t = O ~» (38) 

and two similar equations, 
dAyz/d" + hy/dy + dh,/02 =O : - (4A) 


Let-us White 45, 4e:.vept, 1OF yee 
F,, for hz, Fs, for hy, Fy, for h,, 
F,, for — vex, Fy, for — vey, Fy, for — ve, 


A little trouble will convince one that equations (x) and (2) 
become the symmetrical group of four: 
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OF /9%y + IWF yg/I%q + IF 3 /Ixg + IF 4/K4 = pr, (5) 


where A is made I, 2, 3; 4 in succession and (vj, V9, U3, U4) = 
(vz, Vy, Vz, -), provided we make 


By = Fog = Fa, = Fy = 0, 
and assume 


Fy = =F 


we 


If the four expressions on the left-hand side were the com- 
ponents of a four-vector, and likewise the expressions on the 
right-hand side, the principle of Relativity would be satisfied as 
far as equations (I) and (2) are concerned; for the equality of 
two four-vectors is independent of the particular axes chosen in 
the World, i.e., of the particular frame chosen among the group 
of frames moving uniformly relative to one another in space. 
The first proviso is satisfied if Fy, is an anti-symmetric tensor 
or six-vector, for the left-hand side would then express the four 
components of its Lorentzian. So we could write these 
equations (5) succinctly in the forms 


PN gen od doe . (5A) 
or Lory J, (5B) 


where J,, J2, Jz, J, are the components of a four-vector, the 
“stream ’’ four-vector, and equal respectively to pvz, pvy, pvz, 
up. ¥F is usually called the “ field’ six-vector. 

Of course, the assumption that Jf is a six-vector (which is 
necessary, if the equations are to pass the restricted relativity 
test) means that if the values of h and e measured in another 
frame S’ are denoted by accented letters, then these six equa- 
tions are true: 


Tie a. th. OG) 


(aB going through the sequence 23, 3I, 12, 14, 24, 34 in the 
summation). This implies that any component of the electric 
or magnetic fields in S’ is a linear function of all the six com- 
ponents of the field in S, the coefficients being certain constants 
depending on the relative motion of the frames and the relative 
orientation of the axes in each frame. In Chapter IV. we 
obtained these relations for the simple Lorentz transformation ; 
in that case, certain of the coefficients Were zero, so the right- 
hand side did not contain the full number of six terms. 
Similarly, the assumption that J is a four-vector means 
that the components of the three-vectorial current density and 
9 *% 
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the scalar charge density, as measured in a frame S’, are 
connected by definite linear relations with the similar four 
quantities as measured in S. Thus 


p Vx = A,pVx + AyopVy + Ay 3pVz + ay, | ( 
a 


and two similar equations 
and tp’ == AgpVz + Agopy + Ay3pVz + Aaalp 


The relations obtained in Chapter IV. were the degenerate 
form of these for the simple Lorentz transformation. 

As a matter of fact, it is not difficult to throw equations (7), 
(8), and (9) of Chapter IV. into a three-vectorial form which is 
equivalent to (6) and (7) above. Thus equations (7) and (8) 
of Chapter IV. become (after making the change from large 
letters to small) : 

’=afe + [u.h]}—(a—1)(u. emis 6 
h’ oth [a el} = (a= iu. bel 


Equations (9) of Chapter V. become : 
pV’ = pv — apt + (a — I)p(u. hia 
p’ = afp — p(u . v)} 


(Compare equations (E) and (F) of Chapter V.) 
We have still to deal with the equations (3) and (4). It will 
be found that they can be written as four equations in the form, 


WF y,[d4, + Fya[d%, + Fy,f2%,=0, . (8) 


where we write for A, p, v: 


(74) 


2, 3, 4 in the case of the first equation ; 


eed anes $5 ur a RSCCONG pars 
Danae hs, * Pa esitds) ae 
Ts 2, S +” ” ” fourth 2) 


But for our immediate purpose, if we consider the anti- 
symmetric tensor reciprocal to F,,, viz., R,,, where R 
etc.; R,, = 0, etc., we can write (6) in the form: 


23 — 444, 
ORO 210 j i . (8A) 
or, using the six-vector reciprocal to f, in the form 
Lor Riso). : . (8B) 


Thus the equations (3) and (4), being expressible in vectorial 
form, on the same assumptions as before as to the relations 
between the field components in the frames S and S’, also pass 
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the Relativity test. In fine, the conditions for the relativity of 
the field equations are that f, i.e. (h, — ce) is a six-vector, and 
J, ie. (pv, w) is a four-vector. That is the succinct way of 
stating the equations (6) and (7) above. 


ELECTROMAGNETIC POTENTIAL. 


If we take any four-vector function A and obtain its Curl, 
i.e., 043/d”%_ — 04,/d%5, and five similar components, the result 
is a six-vector. To any six-vector we can apply the ‘“ Lorent- 
zian ”’ operator and obtain a four-vector. Now, although it is 
not in general true that the Lorentzian of Curl A is zero, it will 
be found on trial that the Lorentzian of the six-vector reciprocal 
to Curl A is zero. Now, since Lor f is zero (equation (8B)), 
where IR is reciprocal to Ff, the field vector, this suggests that 
a four-vector A exists such that 


Cor An, a} 
If this is so, then by (5), 
Lor Curl A. =J. : ; ST (TO) 


It will appear on examination that equation (9) is the six- 
vectorial form of the six three-vector equations : 


h— curl a 
e = -- grad ¢ — da/dz, 


Bieter gt A — Ay, A, — 0; Ay wp. 

So we have been really giving its general vectorial form 
to the results obtained as equations (10)-(12) of Chapter IV., 
where we demonstrated the “ cogrediency ”’ of the three com- 
ponents of the vector potential and the scalar potential, 
i.e., establishing the fact that the three components of the 
vector potential and the scalar potential (affected by +) con- 
stitute a four-vector. (Note that in Chapter IV. we were using 
the symbol A for the three-vector potential of the field, whereas 
we are now using a for that function, keeping A for the four- 
vector “‘ electromagnetic potential.’’) 

Equation (10) is a succinct four-vectorial form of two very 
well-known results concerning the vector and scalar potentials. 
Thus the first of the Cartesian equations in terms of space and 
time co-ordinates in the frame S, obtained by translating 30) 
into the more customary symbols, turns out to be 


d(div a + d4/d¢)/d” — [_|ax = prx, 
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where [_] is the ‘‘ Dalembertian ’’ operator 
32/dx2 + d2/dy2 + d2/dz2 — d2/d22. 


In fact, the first three equations obtained by translation of 
(10) are summarised in the three-vectorial form 


grad (div a + 0¢4/d¢) — [_ la = pv. 
The fourth of (10) turns out to be 
Ad + d(div a)/d¢ + p = 0, 
where A is the “ Laplacian’ operator 
02/0x? 07/0? -- 07/027. 


Hence if, as is generally assumed in electromagnetic theory, 
we subject the vector potential a to the restriction 


div a + 2d¢4/d¢ =: 0, 
we have proved that 


[ Ja + pV=0 
and [+p =9, 
or, in four-vectorial form, the four-vector A _ satisfies the 
condition 

[JA+J=0, . : oa 


where [_ | is still the ‘‘ Dalembertian ”’ operator but is now the 
symbol representing the operation 


07/d%47 + 92/dx_2 + 92/dx5? + 02/dx,2, 
being, in fact, the invariant ‘ 
operator 2/dx,. 

Equations (9), (10), (II) are interesting, not merely on 
account of the elegant brevity with which they express well- 
known equations of electromagnetic theory, but because, being 
in vectorial form, they demonstrate the relativity of these 
equations. 


“square”’ of the four-vector 


CONSERVATION OF CHARGE. 


It can easily be seen that the Scalar Divergence of the 
Lorentzian of any six-vector is identically zero, as the twelve 
terms cancel in pairs. Hence 
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Div (Lor F) = 0, 
and so by (5) 
Div J =o. 


In Cartesian form this is 


d(pvx)/dX + dI(pvy)/dIY + B(pvz)/dz + dIp/dt = 0, 
or div (pv) + p= 0, 


which is the mathematical expression of the fact that any 
increase or decrease of electric charge within a given region 
definitely marked out in the frame S is entirely accounted for 
by a flow inwards or outwards across the bounding surface, 
there being no creation or destruction of charge in the region. 
So conservation of charge is likewise an invariant relation, 
i.e., if true in one frame it is true in all. 


INVARIANT QUANTITIES. 


Since J is a four-vector, its magnitude is invariant ; this 
is the square root of p*(v,? + v,? + v,? — 1), or (ignoring 1) 
p(I — v?)t. As we saw in Chapter IV., this means that the 
charge on a definite body is estimated alike by observers in any 
frame. 

Also, the magnitude of F is invariant ; this is 


Ra —Se,?)t , 


or h? — e? is the same for all observers. 

The scalar product of two six-vectors is invariant, therefore 
(J . IR) is invariant. On examination, this turns out to be the 
invariance of the geometric product of the electric and magnetic 
three-vectors, viz., (h. e). 


RELATIVITY OF ELECTROMAGNETIC FORCE AND ITS 
ACTIVITY. 


We have seen that we can form a four-vector product of a 
four-vector and a six-vector. Let us form it for the stream 
vector J and the field vector Jf. The components of [J . ¥] 
are J,Py, + JoP ie + JaFis + Jaf 14, and three similar expres- 
sions. On examination the first reveals itself in Cartesian form 
as p(éy + vyh, — v,hy) ; and the next two complete the triad of 
compovients parallel to the axes in the frame S of the electro- 

magn tic force on unit volume of a charged body moving relative 
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to the frame with a velocity v, viz., p(e +-[V.h]). Let us call 
this pf, where f is the force on unit charge. The fourth com- 
ponent of [J . FF] is found to be «p(vyex + Vyey + Vz€z), which 
is identically equal to ip(vzf, + Vyfy + v,f,), since terms involving 
h,, hy, h, in the latter cancel in pairs. This fourth component 
is, therefore (ignoring +), the activity of the force per unit 
volume (v . f) being the activity of the force per unit charge. 
Now it has been established above that p/f is invariant (where 
B =1/,/(1 — v2). Hence it follows that if fis the force per 
unit charge on a body Bf;, Bfy, Bf, ~B(v . f) constitute the com- 
ponents of a four-vector, and since the charge on a body has 
been proved to be invariant, it appears that the electromagnetic 
force-components on a given charged body, and the force 
activity (affected by .) when each is multiplied by f constitute 
the components of a four-vector. We have seen a few pages 
back that this is the condition required to establish the relativity 
of the equations of motion. 

So far the interest in this four-dimensional vectorial analysis 
has lain in the elegant and succinct form it has given to the 
expression of a number of results concerning the relativity of 
equations which we already had discovered in connection with 
the simple Lorentz transformation. As a matter of fact, it is 
a mathematical method as powerful as it is elegant, and it is 
time to employ it in developing other results not dealt with so 
far, and in showing how by natural steps we are led into the 
treatment of the dynamics of a continuous medium. 

It was pointed out above that if Jf be a six-vector, then 
we can form a tensor of the second order with components 
PF \gFug — RygRua, Where WR is reciprocal to Ff. 

Take Ff to represent the field tensor and form the tensor 
(RyoRua — FyaF ua). To facilitate this, let us write down Ff 
and IR in full as anti-symmetric tensors : 


tole 


Lon 1S. 0 h, —hy, — ley 
—h, O hy — by 

lex ley lez O 

Ryu 1S O — lez ley hy, 
le, O age lex hy, 

ae ley lex oO Mes 


On working out $(RyaRua — Fyal ua) for all possibie values 
of A and uw, we obtain the tensor E),. Its constituents are 
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Pn = $(e,? as Gye — €,2 + n= hy? i 2) 
Fas = Fey? — €,2 — e,2 + h,? — h,? — h,?) 
Ess = $(e," — ¢7 — ey? + h,? — h,? — hy?) 
Eig, = $(e? + h*) 

Ey, = Cylx + hyh, = Es, 

E 3, = €,¢, + h,h;y= ya 

E,, = €xby = h, i= Ea 
Pec i 

Ey, aaa u(ehy €x)z) = Eas 
ay eh eB ,. 


On examination it will be seen that we have written down 
expressions for well-known concepts of the electromagnetic 
field. Thus 


Ey Ey, E13 
21 E oe 23 
31 E 3» E33 


are the nine components of the three-dimensional stress tensor 
of Maxwell; E,4, Ey4, E34, on omission of the factor —, are 
the densities of the components of the “ electromagnetic 
momentum ”’ of the field, viz., [e-h]; and Z,, is the energy 
density of the field.* 

With reference to a frame S, we can refer to the Maxwell 
(symmetrical) stress tensor as having the components 


bier bay bizg 
byx Lyy byz 
tex try bez 


where 7,, = £,,, etc. 

Also, we shall denote the momentum-density by g, so that 
g=l[e-h]; g, = ¢yh, — eh, =t E,,, etc. The energy density 
we denote by ¢, so that «is H4,. The energy-stream vector of 
Poynting we represent by 8, so that S is also [e. h]; sy =i E,.t 

Now. it is well known that in electromagnetic theory we 
have the following four equations connecting these quantities 
with the electromagnetic force and activity dealt with above, 


phe = Wye[d% + bxy/Iy + Vy2/d% — 82/4) 
and two similar equations, 22) 
and — p(v . f) = 0s,/d% + dsy/dy + d8,/dz + de/d ( 


* We are using Lorentz units; hence the non-appearance of the 
farailiar factor 47. Also we are using the relativity unit of time; 


hence the non-appearance of c. 
+ This apparent redundancy of symbols will be explained presently. 
, 
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These are mathematical equations which are true in virtue 
of the assumed truth of the equations of the field (1), (2), (3), (4) 
in the frame. Their particular form brings to the mind of the 
mathematical physicist who is familiar with them the idea of 
‘‘ conservation,” e.g., any of the first three can be written 


Wy/ dt = — pfy + deyx/d% + day/DY + deyz/d2 


(where A is replaced by 1, 2, 3 in succession), and these can be 
interpreted as follows. In the frame S there is a continuous 
medium possessing a momentum g per unit of volume. It is 
in a state of strain which is responsible for a stress whose 
components are ty, éry, etc. Then — pf is the magnitude of 
the “‘ body force”’ on it per unit of volume. For, as is well 
known, the equation then expresses the law that the increase 
of momentum per unit time within a surface surrounding a 
definite portion of the medium is equal to the sum of the forces 
on that part, due to the stresses exerted across the surface by 
the external part, and the body forces. In the days of the 
“ stagnant ether ”’ theory this was a most natural interpretation 
in view of the prevalent desire to reduce the explanation of all 
phenomena to the laws of motion; it was the truth of these 
equations and this natural desire which led to the introduction 
of purely mechanical terms, so that g, or [e.h] was called 
“electromagnetic momentum ;”’ #,,, etc., was called “ stress 
in the ether,’ and — pf was called “‘ force on the ether ;”’ for 
in this latter case, as pf was at all events force on electrified 
matter (supposed to be transmitted by the ether from other 
electrified matter), the opposite of this, viz., — pf, was regarded 
as the reactionary force of electrified matter on ether—an 
implicit appeal to the third law of motion. One difficulty, 
viz., that a strictly stagnant ether could hardly be said to 
possess momentum in the ordinary mechanical sense, was 
evaded by the assumption that the ether must relatively to 
ordinary matter be possessed of tremendous density (ideas such 
as this were very highly developed by Lodge), and so the 
necessary amounts of ‘‘ momentum,” or [e . h], per unit volume 
in any known fields could be easily allowed to exist with such 
minute velocities for the ether as to leave it “ practically 
immobile.” Similarly, the fourth equation of (12), being written 
as 


de/dt = — p(v-f) — divs, 


was interpreted as a “ conservation of energy ” law; this led 
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to the definition of « or $(e? + h*) as the “‘ electromagnetic 
energy ” per unit volume, and so the increase of energy within 
a given portion of the ethereal medium was regarded as being 
supplied by the activity — p(v.f) of the body force — pf per 
unit volume, together with a flow across the bounding surface 
of energy at the rate § per unit area per unit time. There is 
one matter which requires notice before passing on. The 
symbol [e.h] has appeared both as density of ‘‘ momentum ” 
and as “ energy-stream ”’ density ; this apparent contradiction 
of the facts of dimensions arises from our special unit of time. 
As stated before, to return to usual units, we must replace v 
by v/c and ¢ by ct, so that (12) would appear as 


phe = Ytyx/d"” — C7 10(eyh, — eghy)/d 


and two similar equations, 
and -- p(v.f) =cdiv[e.h] + de/dt, 


so that in Lorentz units of electromagnetic quantities, employing 
the second as the unit of time, the “ electromagnetic momen- 
tum ”’ per unit volume is c~1! [e. h] = g, while the “ Poynting 
vector’ of energy-current 8 density is c[e.h], or c?g, which, 
of course, satisfies dimensional requirements; for as [e.h] 
has the same dimensions as e? + h?, i.e., energy-density, 
therefore c~1{e.h] has dimensions of momentum-density (since 
c is a velocity) and c[e-h] has dimensions of flow of energy 
per sec. per unit area of cross-section of the energy radiation 
beam. 

These interpretations of equations (12) have lost some of 
their former importance in view of the irrelevance of the concept 
of an absolute space filled with an ether stagnant or “ practi- 
cally ’ stagnant. If it be desirable to retain the concept of an 
ether, so as to give the tyro in Physics a working model to 
correlate the phenomena of optics and electromagnetics with 
those of deformable media, then we must be prepared to allow 
every observer to ‘‘ carry his own ether about with him.” For 
our immediate purpose the interest in equations (12) centres 
in the fact of their relativity ; if they are true in the frame S, 
equations of the same form are true in S’, provided equations 
(6) and (7) form the basis for transformation of the measures 
of the physical quantities involved. We might assume this 
right away on the ground of the relativity of equations (I), (2), 
(3), (4), from which (12) can be derived; but the fact can be 
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most readily perceived when we appreciate that (12) can be 
written in the form 

OL ya) 0% = f Pla s ‘ $ ( 
or Doris — 13-5) : » “(128 


and can be demonstrated with little trouble. 


APPENDIX TO CHAPTER VI. 


ELECTROMAGNETIC PHENOMENA IN A MOVING MATERIAL 
MEDIUM. 


WE are now in a position to treat in a more complete fashion 
the subject dealt with in the appendix to Chapter IV. Just as 
we obtained in (6A) and (74) above the three-vectorial form of 
equations (7), (8), and (9) of that chapter, so we can also 
easily write down the three-vectorial form of equations (20)-(24) 
of the appendix. They are: 


e’ = afe + [u. b]}} — (a — 1)(u.. e)u/u? (13) 
dq’ = afd + [u. h]}} — (a — 1)(u. d)u/w? (14) 
h’ = afh — [u . d}} — (a — 1)(u.. h)u/u? (15) 
b’ = a{b — [u. e]} — (a —1)(u. b)u/w? (x6) 
J’ =j —apu+ (a —1)(u. juju? . (17) 
p’ = afp — (uw. j)}. ae (x8) 


Suppose now that the medium is at rest in the frame of 
reference S’, and that we assume the usual constitutive equa- 
tions for an isotropic medium at rest, viz., 


a’ -= <b 
b’ — ph’ 
j’ = oe’. 


The equations for the medium in motion with a uniform 
velocity u can be easily written down, assuming that the 
Relativity principle is true. Some simplification is possible if 
we remember that u is at right angles to [u.. a], where a is any 
vector, and so make use of the fact that (u.[u.a]) = 0. We 
first of all obtain 


d+{u.h}=ce+[u.b)) . ~ (29) 
b — [u. e] = p(h — [u. d)). : Ae 5) 


To obtain the constitutive equation for current and electric 
vectors we substitute in j’ = ce’ from (13) and (17). Then 
I4I 


142 RELATIVITY 


take the scalar product of u and each side of this equation ; 
this gives 
af(u.j) — p(u.u)} = o(u. e). 
Introduce this value for (u . j) in the left-hand side of the 
previous equation, and we obtain 
j — pu = oafe + [u.b] — (u.e)u}. 20) 


These constitutive equations (19), (20), (21), combined with 
the usual field equations 


dd/ot + j =curlh 
div d = p 
db/at = — curle 
div b = 0, 


satisfy the principle of Relativity for a medium which is moving 
with a uniform velocity in any of the frames of reference 
considered. 

These constitutive equations are known also not to be at 
variance with any experimental evidence to hand. 

Thus by (19) and (20), if we neglect the square of uw (and 
terms involving w? are inappreciable in the experiments) we 
have 

d=c«e+ (en—1){u.h]. : . ez 


This result has been verified by Wilson, as already mentioned 
in Chapter V.; at least, he has verified the factor « — 1, for 
in his experiment » was practically unity. 

We also obtain 


b=ph—(ew—r)[u.e]... (23) 


This equation implies that if an electrically polarised 
medium be in motion, then a magnetic field proportional to the 
vector product of the motion and field, and also to the factor 
eu — I, should be produced. Such a magnetic field was first 
detected by Réntgen,* and the motion of the polarised medium 
is therefore said to give rise to the Réntgen current. The 
verification of the factor e4 — I, or rather « — 1, was effected 
by Eichenwald.t 

As regards equation (21), its left-hand side is the conduction 


* « Berl. Sitz.”’ (1885), p. 195; ‘Wied. Ann.,” 35 (1888), p. 264; 
40 (1890), p. 93. 
+ ‘Ann. der Phys.,”’ 11 (1903), p. 421. 
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current ; for j represents the total current and pu is the convec- 
tion current. If we consider the quotient of its resolved part in 
any direction by the resolved part of the vector e + [u. b] as 
the conductivity of the medium in that direction from the 
standpoint of an observer relative to whom the medium has a 
velocity u, it appears that this conductivity has different values 
in different directions. Thus, in the direction of u, the value 
is c/a; in any direction at right angles to w it is oa. 

From equation (18) we see that even if p’ = 0, p is not zero 
if a conduction current is observed in the medium by those to 
whom it is at rest; for 


p =afp' + (u. j)}. 
In fact, if p’ = 0, then 


p = (u.j) 
= (u.j — pu) + p(u. u) 
= (u .1) + pw? 
and therefore pod), x : : we gtea) 


where i = j — pu, the conduction current in S. 

The existence of such a charge was first suspected by 
Budde,* and a formula of a similar type plays a part in Lorentz’ 
analysis. It is usually called the “ conduction”’ or “ compensa- 
tion”’ charge. Of course, if j’ = 0, we have p = ap’, and this 
leads, as shown already, to invariance of the charge in a de- 
finite part of the medium ; but it is clear that this invariance 
of charge in the macroscopic sense does not hold if conduction 
currents exist for observers in the rest-system. 

So far, the limitation that the medium moves with a uniform 
velocity in any of the frames of reference has been imposed on 
the analysis, but as Cunningham has pointed out in his book 
(Chapter X.), it is possible to treat the problem in a more 
general way and remove this restriction. Before doing so, 
however, it will be of interest at the moment, and serve for 
purpose of illustration later, if we indicate briefly two of the 
best-known methods of dealing with the question of moving 
media published before the enunciation of the Relativity prin- 
ciple and Minkowski’s application of it to this question. They 
are due to Hertz and Lorentz respectively. 

It is, of course, well known that the “ curl’ equations of the 
field are the differential form of the integral equations which 
express the laws of Ampére and Faraday in their most general 


* « Wied. Ann.,’’ 10, p. 553. 
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form, viz., 
d/d¢ Sha . a8) + fli. dS) = /(h . a8) 
— d/ot So . 48) = /(e . a8), 


where i is conduction current and it is understood that the 
surface S and its boundary s are fixed with reference to the 
axes. Now Hertz, regarding the ether as an absolutely fixed 
frame of reference, considered that when dealing with a surface 
S and its boundary s fixed in a body which is moving relative 
to the ether, one ought to replace 9/d¢ by D/d¢,* thus taking 
account of a change in d due to the motion of § and the (spatial) 
non-uniformity of d, as well as of the purely local time rate 
0/d¢. Now it can be shown that 


D/dt . //(a . dS) = //({oa/d¢ + Vv div a -- curl [v. a]}. dS), 


if v be the velocity of the body at the point where a given 
vector has the value a. Hence Hertz’ assumption leads 
readily to 


S/({ad/dt + v divd — curl [v . d] + i}. dS) 
= {(h. ds) =)// (eu) beds) 


by Stoke’s Theorem, and a similar equation for b. 
So that Hertz’ four equations of the field are 


dd/ot + i + pv = curl fh + [v. d] | 


div d — p 
b/d = — curlfe — [v. b]} - (25) 
div Dy= 0: | 


In framing his constitutive equations Hertz regarded the 
effective polarising electric intensity as the sum of e, and an 
intensity [V . b] due to the motion of the medium through the 
magnetic field b, and similarly that the effective polarising 
magnetic intensity is the sum of h and an intensity — [v . d], 


due to the motion of the medium through the electric field d. 
Thus he writes 


* If the reader is unfamiliar with the meaning of the operator D/dt, 
he will find it explained more fully in the succeeding chapter. Whereas 
d/d¢t refers to a rate of variation at an element of surface or volume 
fixed relative to the axes, D/dt refers to a rate of variation at an element 
moving relative to the axes, and possibly changing in size also. In 
fact, Do(%, y, 2, #)/t = dp/dt + (v . grad b) + > div v. 
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d = efe + . b)}) 
ean an alll me eee 


Now, quite apart from the question whether these equations 
satisfy the Relativity principle or not, it is certain that equations 
(26), which lead (to the first order in v) to 


d=«e+-[v.h] 
b=h—év.e] 


(u being practically unity), are contradicted by the experiments 
of Wilson and Eichenwald. 

Lorentz’ method was based on the theory of electrons and 
an averaging or “‘ smoothing out” process, in which he takes 
account of the motion of “free’’ electrons relative to the 
medium as the conduction current i and the motion of 
“bound ” electrons relative to the medium and of the medium 
relative to the ether as the convection current. Also, the 
“ displacement ”’ d is the sum of the field intensity e and the 
polarisation p. Of course, all these symbols refer now to 
macroscopic averaged values and not to the microscopic values, 
for which Lorentz assumes as his fundamental equations those 
written in (1), (2), (3), (4) of the chapter. The macroscopic 
equations obtained finally by Lorentz are: 


dd/o¢ + i+ pv = curlf{h + [v. p}} 


div'd =p 
db/d¢ = — curle (27) 
diy, D0. 


As his constitutive equations Lorentz takes 


d+([v.b] =«{e+[v.b] 
Balen ah a 28) 


assuming that the term [v.b] will affect both displacement and 
intensity, and similarly for the term — [v.d]. 

These constitutive equations of Lorentz do not contradict 
the Wilson and Eichenwald results. 

To investigate how far these equations of Hertz and Lorentz 
‘(or any others which have been suggested) agree with the 
Relativity principle (irrespective of their truth to nature), let 
us adopt Cunningham’s method of presenting Minkowski’s 
treatment. Taking for granted that there are five vectors, 
ax, eX, b*, h*, j*, which are connected by the four differential 
. equations in a given frame of reference: 

10 
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2d * /a¢t + j* = curl h* 


div<d* =p 
db* /d¢ = — curl ec 2 N20) 8 
div-b* =.0 


we wish to know if similar equations are valid in any other 
frame of reference moving with a uniform velocity with respect 
to the first. (Note that this puts no limitation on the movement 
of the medium relative to any of the frames.) From the treat- 
ment in the appendix to Chapter IV., and at the beginning 
of this appendix, the question is answered in the affirmative ; 
but we can drive the point home still more clearly and open 
up the way for a relativistic treatment of the constitutive 
equations by availing ourselves of the methods of Tensor 
Analysis. Thus let us consider the four six-vectors : 


Ti — h,*, hy*, POS? ae igs a hy*, aie ud,* 


= (D1) 
F, = (b*, — 10%) 
R, oor (e%, tb*) 


Re Sey (d*, ch*) 
and the four-vector 
J = (J, ¢p). 


Equations (29) can be written in the form 


Loree. 
Lorik, = a (30) 
which justifies their claim to agreement with Relativity, pro- 
vided ff,, etc., are six-vectors and J a four-vector, i.e., provided 
there are the usual linear relations between fields, current and 
density as measured in two frames of reference in uniform 
relative motion. 

In order to construct relativistic constitutive equations, let 
us consider the following four-vector products of the four- 
vector : 


V = Bly, ») 
and the field six-vectors, 7, etc. 


_ * We write the small cross x after each symbol so as to avoid any 
implication at the outset that it represents the same physical vector 
as the corresponding symbol does in any other treatment. 
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D=([V.F,] = A{d* + [v. h*], uv. 
E=[V.F,] = Aie* + [v. b*], ov 
B = [V.i,] = A{b* — [v.e*], dv. 
H =[V.i,] = efh* — [v. dx], uv. 


Now any linear relation between D and E, or between B 
and H would, of course, be a constitutive equation which 
would satisfy the principle of Relativity. As to its truth to 
nature, that would be a matter for experiment ; but as a first 
attempt it is obviously natural to write down one which 
degenerates to the usual type for a medium at rest in a frame. 
Such a pair are clearly 

D=c«E \ 


and B= pH (31) 


In terms of the field three-vectors these can be seen to be 
equivalent to 


dx + [v-h*]=e{e% +. De 
b* — [v. e*] Sayre == [Vara-=|: ; 


for these imply that 


(32) 


(v.d*) =«(v. e%) 
and (Vv. b*)=yp(v.h%), 


and so comply with the complete equalities in (31). 
For the third constitutive relation it would seem natural to 


write 
J = ck, 


but since (V.E) =o, this would necessitate (V.J) =o, 
which is not generally true. We can get round this difficulty 
by resolving J (four-dimensionally) into two components 
parallel and perpendicular to V, thus: 


J=1+AvV, 


where (V.I) =o, and A is a multiplier to be determined. 


Since (V . I) =o by hypothesis, 
Vie J AV . Vv) 
= A(B2v — 0°) 


* We write these as D, B, etc., in accordance with our convention 
for symbols representing four-vectors. They must not, of course, be 
- confused with the field three-vectors. 


Io * 
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Also (V.J) is an invariant, so we can calculate its value 
by referring to a frame in which the point in question is at rest, 
in which case V = (0, 0, 0, «) and J = (Jor, Joys Foz Po), pu being 
the “ proper ’’ or “ rest’ density of charge. 


Hence 
Ve J) p, 
and so 
Apa 
Hence 
J=I+ pV. 


We can now assume for the third constitutive relation 
I=J—p.V =cE. 
This leads to the three-vectorial equation 


i* — Bev = Bofe* + [Vv . b*]}, 
and also to 
ip == tBa(V .e*) + tBpo, 
so that 
BpoV = pv — Bo(v - eX) Vv. 


Hence the assumed constitutive equation becomes finally 
1% pV = Bales iys D4) — (v 265) vi) See 


We are now in a position to test the theories of Hertz and 
Lorentz, from the point of view of their ability to pass the 
Relativity test. In the case of Hertz’ equations, we write 


a= d= 
De be 
h+[v.d] =h* 
(2 Pee hoe) anes 
j=) pV 
and thus the field-equations of Hertz are relativistic. As 
regards the constitutive equations, however, these ought to be 


d + [v.hj + [v [v. d]] =ee, 
or to the first order in v, 
d+ ([v.h] = ee. 


Such a constitutive equation would comply with Relativity, 
but it does not agree with experiment (any more than Hertz’ 
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own equation, which is not even relativistic), at least if e is 

interpreted in the usual way, viz., that the difference of electric 

potential between two points A and B is measured by /(e . ds). 
AB 


Similarly a second relativistic constitutive equation suitable 
for Hertz’ equations would be 


b — [v.e] + [v.[v- b]] = zh, 
or b—(v.e)=pzh 


(to the first order), to which similar remarks apply. 
To test Lorentz’ equations we put 


a a. 
ere 
b= b* 
and h + [v.p] = h* (where p = d — e) 
j=1- pv = j*. 


As before the field equations are clearly relativistic. The 
relativistic constitutive equations derived from (31) are 


ay Be Vel) <0 ve. Di; 
and b — [v.e] = p{h + [v. p] — [v. d]} 
= p{h — [v . e]}, 
or to the first order 
d = <{e + [v. b]}-—[v-h] 
and b = vh — (u — 1)[vV . e], 


with which Lorentz’ equations agree to the order required, and 
on the assumption that yp is practically unity. 

Of course, still more general linear relations could have been 
assumed in the first instance instead of (31), and would certainly 
have to be assumed in the case of eolotropic media. The 
relativistic course of procedure would be to write 


D— Aer, Ei, 


where « would be now regarded as a tensor-operator of the 
second order. This means that we write 


Dy = €,F , + €yoo + €y3H3 + aay 


om ee, a Egol’ » = €q3ls + Egil g 
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and assume that the e,, transform by the relations 
yu’ = Fa up ews 
which would ensure that in a new frame 
Dy =e,Fy +8; +e Fs +1421, 
Pathe subject matter of this appendix is an excellent illus- 
tration of the method of Relativity, not as a means of actually 


giving a correct equation or law, but as a test between different 
alternatives. 


CHAPTER VII. 
RELATIVITY AND THE CONSERVATION LAWS. 


THE relativity of equation (12) of the last chapter, and the 
attempt to interpret it in terms of purely mechanical concepts 
such as “momentum,” “ force,” “‘ energy,” naturally leads us 
to enquire as to the possible relativity of similar relations 
(i.e., conservation laws) in strictly mechanical phenomena. 
We have worked out the conditions which ‘‘ mass” and 
“force ’’ must be subject to, in order to satisfy the postulate 
of Relativity in particle dynamics, where bodies are treated as 
indivisible things, whose rotation or internal condition is of no 
immediate importance. It is an obvious extension of our work 
to consider the bearing of Relativity on the dynamics of con- 
tinuous media, where internal state is considered. (In Rela- 
tivity mechanics we cannot speak of ‘‘ Dynamics of Rigid 
Bodies,’’ as the concept of rigidity or undeformability must 
disappear in view of the alteration of size and shape which 
accompanies an alteration of velocity relative to any chosen 
frame.) It is here that equations (12) can render us some 
assistance. At the time of their discovery it was quite natural 
to attempt to interpret them mechanically; now we may 
reverse the process and use them to suggest to us the form in 
which the dynamical equations for continuous media should be 
exhibited so that their relativity may be most directly demon- 
strated. 

We know that the laws of conservation of momentum and 
energy, when expressed analytically, give a comprehensive basis 
for general dynamics. We must be prepared in Relativity 
dynamics for the identification of mass and energy already 
suggested by the dynamics of a particle, and also for variation 
of the mass (and energy) of portions of matter which we have 
been in the habit of segregating as discrete bodies, bounded by 
conceptual surfaces, which are the loct of points representing 
presumably identifiable particles. The results just obtained in 
electromagnetic theory are bound to react on our ideas and 
invite us to test whether the principle of Relativity does in any 
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way call into question conservation of energy and momentum, 
in the sense that if demonstrated as true in one frame of refer- 
ence it might not be true in another. In discussing the laws of 
motion for continuous media, we are accustomed to make use 
of the concept of the stress at a point in the medium, analysed 
into nine components : 


byx bry bye 
by tyy bys 
tee tyy beg 


where ty, is the tension per unit area exerted across a plane 
perpendicular to OX in the direction OX, ¢,, is the shearing 
stress per unit area exerted across a plane perpendicular to 
OY in the direction OX, ¢t,,, across a plane perpendicular to 
OX in the direction OY, etc. This is, in fact, the three-dimen- 
sional ‘‘ stress-tensor ’’ well known to readers of text-books of 
Elasticity. When the medium is in motion in a frame S, each 
portion of the medium possesses momentum and also energy. 
So we can speak of a “ density of momentum ”’ g at a point 
in the sense that if 67 is an element of volume, containing the 
point, gé7 is its momentum; and of a “ density of energy ”’ 
ps, Where pdr is the energy of the volume element. (p57 may be 
kinetic energy, potential energy due to body forces, heat energy, 
or electromagnetic energy.) 

On such an element 57 there may also be a “ body-force ”’ 
exerted due to external causes and not arising from the stresses. 
This we will denote by kér, so that k is the force per unit 
volume of the medium or “ force-density.”’ 

It is shown in text-books of Elasticity and Hydrodynamics 
that the force on an element arising from the internal stresses 
is given by 

Sr(dtxx/d% + dItvy/Dy + dIty_/d2) 


in the direction OX and two similar expressions, so that the 
total force is given by 


Ot(ky + Ydtyx/d%) 


in the direction OX and two similar expressions. 

In Chapters III. and VI. it appeared that the verbal form 
of the second law of motion could still be retained in Relativity 
dynamics of a particle, provided the notions of mass and 
momentum were somewhat generalised, and the expressions for 
the force in different frames related to one another in a particular 


RELATIVITY AND THE CONSERVATION LAWS 153 


way. Our natural desire is to retain the law also in the 
dynamics of a continuous medium, and investigate the con- 
ditions to which our concepts of momentum, energy, and force 
must be subject in order to satisfy the Relativity principle. We 
have just written down the expression for the total force on an 
element of volume, and we must obtain the rate of change of 
momentum of the element. It will not do merely to write down 
6rdg/dt. That served our purpose in equation (12); but, as 
was pointed out, the mechanical interpretation of (12) involved 
the assumption that the ether was practically immobile. But 
we are now dealing with a moving medium, and $7dg/d¢ 
would refer to the rate of change of momentum within an 
unchanging element of volume fixed in S, since 2/d¢ implies 
differentiation with regard to #, the co-ordinates x, y, z remaining 
constant. Actually 67 is moving in S and generally altering in 
size. What we want is d(gér)/dt, taking account of change 
of position and size on the part of 87. One portion of this is 
d7dg/d¢t. A second part, due to flow of momentum into 67 
across its surface, is given by 


67 (Vz d8x/0% + Vydgx/2 + V,d8%/d2) 
and two similar expressions, or 
67(V . grad g,), d7(V . grad gy), 57(V . grad g,). 
A third part is due to the change of size in 67, and is given by 
Sr z(dvz/d% + dvy/dXy + dv,/dz) 
and two similar expressions ; or 
drg div Vv. 


Adding together the three sets of expressions and equating 
them to the total force on the element 57, we obtain 


Ry + > (d¢xx/d%) = Dg,/d, 
and two similar equations where the operator D/d¢ is defined 
by the equation 
D(x, y, 2, t)/d¢ = df/d¢ + (V. grad ¢) + ddivV 
= d¢/d¢ + div (¢V). 


In the same way the conservation of energy is expressed 
by equating the sum of the activities of the body forces and the 
stresses on the element 57 to the rate at which the energy in 
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the moving element is changing. This equation is 


(v . kK) + D{d(Ustex + Vytye + Vitex) / dx} = Dp /dt. 


It will be a little more convenient to introduce normal and 
shearing components of pressure rather than tension and write 
Dun = — lex, Piy = — ty, ete. it should aléo.be observed that 
ViPxx + VyPyx + Uehzx, and two similar expressions, are the com- 
ponents of a three-vector, viz., the vector-product of the vector 
v and the stress tensor p, which we can represent by the symbol 
[v.p]. It is not very troublesome to establish that the 
equations just obtained can be written in the form 


Lor ce is. 
where T represents the array of sixteen quantities : 


LFy3= Pac tfn T ye=Prytfevy, LT ys=Piezt 2x02, Tl 


To1=Pyet8yx, - 24 Sy 
31 Pu t+ Bix, , ; ; : TT (x) 
Ty= (uv,+[V : PJ), . . : © Lya= —_—- 
= y 
and 
TG le ay gee 7 a) 


while, as before; 43,4, %., %_.7Téplace:x, v.25 ch 

We have thus arrived at the conclusion that if the general 
dynamics of bodies is to be included within the scope of the 
Relativity principle, the quantities 7), are to transform as 
a tensor of the second order, and K, are to transform as a 
four-vector. Before proceeding to develop this conclusion still 
further, let us see if there is any partial support for it in our 
earlier dynamics. 

Suppose we consider a body in an unstrained state, so that 
for it ¢,y, etc., are zero. Such a body could be regarded asa 
collection of particles with no cohesive forces between them, 
which would travel over exactly similar paths under the in- 
fluence of exactly equal external forces on each particle, no 
internal stresses being required to prevent the body disinte- 
grating. It could be regarded simply as a large particle. Let 
fo be the mass of unit volume of it when at rest in a frame of 
reference. If referred to a frame in which the body has a 
velocity v, the previous unit volume becomes 1/8, and the 
mass of this reduced volume is now pf, so that its mass per 
unit volume is now poh?. Hence for such a body : 
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& = Puyv 
€ = Puy 
and the quantities T), become 
Bi iodx? B2gVxVy B* gad: (Pugs 
Bipodyix B®ugdy” B2gVyUz (BU dy 
B fholsx B'day Begs" 1B ugv 
LBV x Bug y LB? L9dz — B'uo 


Now we have seen that fv,, Buy, Bu,, oB constitute a four- 
vector, and so, by reference to Chapter VI., we find that the 
sixteen quantities just written down form a tensor of the second 
order, being the tensor product of two vectors, viz., Buz, etc., 
and pyBv,z, etc. ; for wp is invariant. Further, if f is the force 
acting on the body and 7 its “ proper’”’ volume, so that 7/f is 
its volume when its velocity is v, then 


k = ffyr. 


From the dynamics of a particle we know that ff;,, Bf,, 
Bf:, .8(v .f) are components of a four-vector. So also are 
y» Rz, (Vv. k), since 7 is a given constant. 
Thus in the case of an unstrained body we have some 
justification for assuming that T), and K, are a tensor and four- 
vector respectively. Indeed, we might find further support for 
this view, even in the case of a strained body by an appeal to 
the kinetic-molecular theory of matter. For we can regard 
pu, as a flow of x-momentum per unit time across unit area 
normal to OX, p,, as a flow of x-momentum per unit time across 
unit area normal to OY, etc., due to the chaotic thermal 
motion of the molecules with reference to axes fixed in the body. 
Thus g,v, + px, would be a total flow of momentum reckoned 
by giving to each molecule a velocity in the frame S com- 
pounded of v and its thermal velocity relative to axes fixed in 
the body. It would be the sum of a number of terms such as 
LyBy(vy)x<2 reckoned for the individual molecules. Similarly, 
Lxvy + Pxy would be Sy,f,(v,)x(v,)y, and so on. Regarding 
each term of these sums as coming under the conclusions 
arrived at for particle dynamics, we would conclude that the 
‘sums are components of a tensor; for if one adds corresponding 
components of a number of tensors, the result is a tensor. 

Turning from such special considerations let us apply the 
assumption that equations of conservation satisfy the Relativity 
test to the case where the motion of the medium is adiabatic. 
This means that with reference to an observer who is at rest 


a) 
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relative to a point in the medium, the momentum-density and 
the energy-current density at that point are zero, so that, for 
instance, there is no transfer of heat relative to him at that 
point. mo os 

Let S’ be a space frame in which the point in question is at 
rest, and at that point 


aig ae mae 
Sy by = hi 0: (3) 


Furthermore, by the ordinary mechanics of material media 
at rest we have at the point 


bys, = bey 5 bon = te ; bey = tye . . (4) 
Hence it appears that 
Dos Se Ts2' Tsy =T,;'; Ty: 


Tyy = T oq = T3y = Tay = Tae) = Ts) = a 

Hence the tensor has symmetrical components when 

expressed in accented co-ordinates. But symmetry is a pro- 

perty independent of axes. Therefore the tensor T), is sym- 
metrical in any co-ordinates. From this it follows that 


8x = Vx ae [Vv : Ply 
and two similar equations ; or, in vectorial rotation, 


g=rv+l[v.p]) a 


MOMENTUM AND VELOCITY ARE NOT IN GENERAL 
CO-DIRECTIONAL. 


The x-component of [v . p] represents the transference of 
energy of strain across unit area perpendicular to OX. Hence 
the right-hand side of (6) represents a flux of energy per unit 
area, and we see that it is accompanied by a density of momen- 
tum numerically equal to this flux in Relativity units. (To 
obtain the momentum density in C.G.S. units, we would as 
usual have to divide the right-hand side by c?.) This result is 
a wider aspect of the identification of mass and energy already 
obtained in our particle dynamics. In the special case of an 
unstrained medium, 


§ = vv. 


But, in general, as we may demonstrate from (6), g and v 
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have not the same direction. This can be shown most readily 
by using the simple Lorentz transformation, a small portion of 
the body being at rest in S’ but in motion relative to S, with a 
velocity u parallel to OX. In this case the scheme of ay, 
coefficients becomes 


a O oO tau 

0 18 O oO 

O oO I O 
— tau O oO a 


where a = (1 — #?)—4; and by (1), (3), and (5), 
Py = Pex’, Ty9 = Pry’, T33) = Pz’, Ty,’ = 0 
and two similar sets; and 
Sree = O; j As = O, fees! sd O;, Dan, —_ -— rigs 


Therefore 
Ga4pal op’ 

, 
@y14141 44 + GaGaal as 
La? 24 (pL + Dux 5 


Dong —— Aa2dpal ap 
= Ago 4T 9)’ 
= apy,’ 
and Lo, = wap... 
ies; ae au (pe i Pix’) | 
Sy = apy,’ ee me 
£2 = aupzz. 


Hence if the medium be strained the momentum density g is 
not in general directed along OX, i.e., along the direction of 
motion in S of the point in question. This will only occur if 
there be no stress, or if the stress components py,’ and ;,’ 
vanish, in which case the state of stress for the body at rest is 
one of uniform pressure in all directions or simple hydrostatic 
pressure. This difference of direction between velocity and 
momentum is important, as it offers an explanation in detail of 
the negative result of an experiment carried out by Trouton and 
Noble in 1903, which, had it proved successful, would have 
exhibited positive evidence of terrestrial motion through the 
ether. We shall deal with this experiment presently, but 
before doing so it will be of interest to deduce one more impor- 
tant result from the postulated relativity of our equations. 
If in the frame S’ the medium at rest be subject to a uniform 
hydrostatic pressure #’, then 
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and the remaining components are zero. So every component 
of T’y, is zero except four, viz., 
Ty,’ = Toy’ = T3;' = p’, and D gg lees 
Hence the equations of transformation give 


Ty = Wei ea! ae 
=hyi4qq] ay 4 Arab alas 
= af’ + a?ury’ 

1.€., Byt + dex = ap + ary’. 
But by (7), 
gx = auly! + f'), 
so that pic Gl 
Also To. = Aq 2T a8 
ie, by (1) by =P" 
similarly a = fp’. 


Proceeding in this way we can likewise prove that fy, 
etc., are zero. 

Hence if a medium, which is subject to a uniform hydrostatic 
pressure when referred to axes with regard to which it is at 
rest, be in uniform motion with regard to other axes, not only 
is it still subject to uniform pressure, but the measure of the 
pressure 1s tnvariant, or p = p’. 

Moreover, by (7), 

ge = ay! + p' 


and &y = &: = 0,7 


so that, as already stated, momentum and velocity have the 
same direction in this case. Considering a volume 7 of the 
body in S, its momentum parallel to OX is gyr, or a?(u’ + p')ru. 
But 7’, the volume of this part of the body in S’, is given by 
7’ ==at Hence the momentum is a(u’ + ’)7’u, or a(U’ + p'r’)u, 
where U’ is the internal energy of this part of the body in its 
rest frame. Hence for a mechanical interpretation of the 
momentum we are to regard the mass of the proper volume 7’ 
of the body as 
a(U" +. p'r’) 


when it has a velocity w. That is, we are to consider its proper 


mass as 
(Of! + pt : 
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This is the well-known Heat Function of Gibbs, and the con- 
clusion reached is in agreement with familiar notions concerning 
pressure, which regard it as an energy of strain per unit 
volume to be added to the usual internal energy, thermal, 
electric, etc. This result is an extension of the earlier result 
obtained for particle dynamics. 


TROUTON AND NOBLE’S EXPERIMENT. 


Returning now to the experiment of Trouton and Noble * 
their apparatus consisted of a condenser suspended with its plates 
vertical by a very fine phosphor bronze strip. The charges on 
the plates were passed into them by means of this wire and by 
a wire hung from beneath and dipping into a liquid terminal. 
The argument (suggested by Fitzgerald) was that if this con- 
denser were drifting with the earth’s motion through the 
ether, with the normal to its plates inclined to the direction of 
drift, then there should be a mechanical effect tending to turn 
the normal into this direction, and a rotation should be observed 
when the plates were charged and discharged. This experiment 
differed from the other experiments on ether-drift, in that it 
sought for a mechanical effect as distinct from an optical or 
electrical one, but like them proved to be negative in its result. 
Had it been otherwise, it would have provided an experimental 
contradiction of the postulates of Relativity enunciated two 
years later by Einstein. The mathematical basis for the 
expectation ran as follows. If there be a horizontal electric 
intensity between the plates X, the drift of this field through 
the ether with a resolved velocity “ at an angle 6 to the normal 
would give rise to a magnetic field in a vertical direction equal 
to Xu sin @. The energy of this magnetic field is 4 X*u? sin? 6 
per unit volume, and so the whole magnetic energy is Wu? sin? 6, 
where W is the total electrostatic energy. (This result must 
be slightly modified if we take account of a Lorentz-Fitzgerald 
contraction owing to motion through the ether, but the final 
result is not practically altered.) Since this magnetic energy 
(for constant charges on the condenser) is minimum for # = o, 
there should be a couple tending to turn the condenser to this 
orientation equal in magnitude to 


d(Wu sin? 6)/d0, 
1.e., to 
Ww? sin 20. 


Soe ia Dat se, oe hil. Lrans.,”” 202, p. 163. 
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Now, as a matter of fact, no rotation results form the exist- 
ence of this couple, and the explanation of its absence bears 
some analogy to the fact that in Relativity mechanics a force 
may not produce acceleration in a body provided its mass 
alters owing to loss or gain of energy by radiation, etc. We 
consider the condenser as at rest in a frame of reference S’ 
which is moving with a velocity « parallel to OX relative to 
the frame S (Fig. 5); the normal to the condenser makes 
an angle 6’ with OX’. (The angle it makes with OX in S, 
viz. 0, differs from 6’ by a quantity of the second order, but 
this introduces no practical error in the result. Then in S’ 


Ey =X cos 0) By = X-sin 0, 0. 


Ul 


vf iy 


Fie. 5. 


We regard the dielectric as being in a state of strain, and 
the stress components as given by Maxwell’s formule : 


ben aa (ce = | Barv = Te?) => 4X? cos 20’ 
tay’ == BE, = 4X? sin 260’ 
bred = y chad ae = O. 


Also, ys’, the energy density, = 4X2, and the components of 
g’, momentum density, are zero. 


Referring the condenser to S, we have by (7), 


& = aup'(I — cos 26’) 
= 2a7ujp' sin? 6’ 

&y = — aup’ sin 20’ 

&z = 0. 
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To obtain the total momentum of the condenser in S we 
must remember that an element of volume SxSy8z in S is equal 
to I/a of the corresponding element of volume in S’. 

Hence 

LT] gxdxdydz = 2a sin? 0 //ju'dx'dy'dz’ 
; = 2aWu sin? 6’, 

SV gydxdydz = — Wusin 26’, 

MV /adxdydz = o. 


If we calculate the moment of momentum about the axis 
OZ, it is equal to 


ST] (98x — %8y)dxdydz 
= 2yaW wv? sin? 6’ + xWu sin 26’, 


where ¥, y are the co-ordinates of O’ with respect to OX, OY. 
Since x is changing at the rate u per unit of time, but y 
is constant, it appears that the rate of change of angular 
momentum round OZ is equal to 


W u? sin 20’, 


which is just the value calculated above for the couple acting 
on the condenser. 

It will be seen, quite apart from this detailed analysis, that 
the existence of a couple without accelerated rotation depends 
on the fact that the momentum has a direction different from 
that of the velocity, whereby a change in the angular momentum 
is produced not by a change in the momentum, but by a change 
in the arm of the moment of this momentum caused by the 
motion. Thus, in the figure, the line along which the resultant 
momentum is directed is O’P’, and the distance of O from this 
line is increasing (which would not be the case if it were O’X’), 
and so there is a right-handed alteration in the angular momen- 
tum about OZ, corresponding to the right-handed couple 
tending to diminish the magnetic energy as calculated earlier. 


RELATIVITY AND THERMODYNAMICS. 


It will be of interest before leaving these considerations 
concerning continuous media, to deal with one or two important 
conclusions in Thermodynamics. 

Suppose we have a thermodynamical system at rest in S’ 
with a uniform pressure p’, a volume V’, and a uniform absolute 
temperature T’, We have just seen that in the frame S the 

Te 
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system has a uniform pressure p equal to p’, and a volume V 
equal to V’/a. It is natural to ask what will be the effect of 
its motion on the measure of its temperature. If the system 
be a perfect gas, we can obtain an answer very simply and 
directly ; for from the law of Boyle and Charles 


pv’ =< TR 


where R is a constant which (if we adopt the kinetic-molecular 
hypothesis of matter) is equal to Nk, where N is the number of 
molecules in the body of gas considered, and & is the so-called 
“gas constant per molecule,” i.e., the quotient of the average 
energy of a molecule in a gas enclosed in a vessel (at rest in the 
frame of reference) divided by the absolute temperature 
(measured in this frame). Now & is a universal constant, and 
N, which is a number of discrete particles, is not altered by 
change of axes; hence R is invariant for the change of axes. 
But so also is #. Hence 


DN sce 
But if the Boyle-Charles law is to hold in S, 
DNeme be 
Therefore, since V = V’/a, it follows that 
‘eto 


or temperature is transformed like volume. In fact, aT is 
invariant, involving a decrease in the measure of the tempera- 
ture as the velocity of the system relative to the observer is: 
increased. 

This method of drawing the conclusion is, however, unsatis- 
factory. It only applies to a perfect gas, and even then, one 
has to introduce considerations involving the kinetic-molecular 
hypothesis, a procedure quite foreign to reasoning of a purely 
thermodynamic character. 

It is preferable to proceed as follows : 

From the second law of Thermodynamics we deduce that in 
an adiabatic, reversible change the entropy of a system does 
not alter. Now acceleration of the velocity of a system in its 
frame of reference by conservative mechanical forces is cer- 
tainly a reversible operation, and involves no transference of 
heat ; hence the entropy of the system is not altered. From 
this we conclude that the entropy of a thermodynamic system 
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at rest in S’ is equal to its entropy in S. In short, entropy is 
invariant. : 

If now the system has its entropy altered from ¢’ to ¢’ + 84’ 
by a reversible exchange of heat of amount 8Q’ with a source 
at the same temperature T’, we know that 


84’ = 8Q'/T", 


referring all measurements to S’. 
Referring measurements to S, we have 


aa , 
¢ + 06 = $' + bh 
and therefore dd = S¢’. 
But, according to now familiar ideas, the increase of energy 
(when measured in S) in the system is 60 where 
8Q = a8)’. 
At this point it is extremely easy to fall into a serious error 
by writing 
6¢ = 8Q/T, 
and so obtaining the false conclusion that 
f ree ge 


To be sure, the system acquires an increase of energy 
8Q(= adQ’) in S; but we must remember that this involves an 
increase in mass and therefore in momentum, since we postulate 
a uniform velocity uin S. Hence in S a mechanical force must 
be acting on the system equal to udQ/dt. This force will do 
work in time d¢ equal to 

uodl(u . dQ/dt) 


= “76Q. 


This will be part of the increase of energy, but being mechani- 
cal work must be subtracted from the total increase 6Q, to 
obtain the amount of heat transferred from the source to the 
system. This amount is therefore equal to 


8Q(1 — 22) 
== 60 Ja: 
dQ’ £ Ee P and dQ’ 
Hence aS d¢ = d¢ aw 
and therefore fa rat Be 


which is the correct result. 


PAT Sie 


GENERAL RELATIVITY. 


CHAPTER VIII. 


In the world of physical events, the conceptions of distance 
between points and interval between instants have been de- 
prived of the property of possessing absolute and invariant 
values which appertained to them in the space and time of 
the older Physics. They are now components (variable with 
the frame of reference) of a single invariant magnitude, 
separation between events. The manner of compounding the 
separation from its component time and space elements has 
already been indicated, but a certain limitation has been 
implicitly imposed on the choice of frames of reference in 
employing this method of composition. We have seen the 
difficulty of bringing gravitational force as a magnitude 
obeying the Relativity test into our scheme, and so it would 
appear to be a necessary assumption to postulate that the 
Relativity Mechanics and Electromagnetic theory which have 
been developed in the previous chapters are valid only 
for frames of reference in which gravitational action is absent, 
i.e., in which the rectilinear motion due to inertia is the most 
obvious feature and curvilinear paths do not exist, unless pro- 
duced by so-called mechanical actions such as pressures, pulls, 
impacts, etc., or by electromagnetic action. Now no such 
frames are available for us, unless we adopt the device indicated 
in the Introduction of attaching our axes to a minute portion 
of matter travelling in a natural orbit with the acceleration 
characteristic of its position with reference to the gravitating 
bodies which influence it; and even then gravitation is 
“removed ’”’ only for a minute portion of space around the 
origin, but not for a widely extended portion ; at some distance 
from this minute particle natural motion would be curvilinear 
in this frame also. This is so because, referred to our ordinary 
frames, the accelerations characteristic of the place where the 
particle is, and of any other place at a finite distance from it, 
are in general different. 

But although finitely-extended non- ee tonal frames do 
not exist, it is clear that we can imitate closely many of the 
mechanical effects of gravitation by conceiving the movements 
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in such a frame as referred to a second frame moving in some 
non-uniform manner with respect to the former. For example, 
such a conception would reproduce perfectly the most striking 
feature of gravitational action, the non-dependence of the 
acceleration on the mass or nature of the accelerated body, 
provided there is no resisting medium. This suggests, there- 
fore, that if we could develop a calculus which would gener- 
alise the method of transformation adopted in Chapter VI., and 
remove the restriction of linear equations of transformation, 
we should be in a position so to generalise the mathematical 
forms of the laws of Physics as to give them validity in gravi- 
tational frames of reference. 

As an initial step it would be natural to determine the new 
expression for the separation between two events. We can 
illustrate this by a few examples. Thus, suppose we had a 
non-gravitational or “‘ Galilean ” frame S’, and conceive another 
frame S to be rotating uniformly with respect to S’ around the 
axis of 2’ with a uniform angular velocity w. The familiar 
“‘joy-wheel”’ is an example of such a frame in which all the 
people sharing in the general rotatory motion of the wheel 
experience a centrifugal acceleration which carries them off 
the wheel precisely as if each one were subject to a repulsive 
gravitational force from the axis proportional to the square of 
his distance from the axis, and also (the most important point 
of all) to his mass. We could adopt as equations of transforma- 
tion (approximately true at all events) 


x’ = * Cos wt — y sin wi 
y’ =x sin wt + y cos wi 
Deane 
fait, 

So if two events occurred in S’ whose co-ordinates are 
(x’, y’, 2’, t’) and (x’ + dx’, . . . ¢’ + 82’), their co-ordinates in 
S would be (x, y, z, 4) and (w + 6x, . . . ¢-++ 82), where 

Of == OF COs wt — dy sin wt — w(x sin wt + y cos wt) dt 

os = ee sin wt ++ dy cos wt + w(% cos wt — y sin wt) dt 
2 OR 

of’ = Sf. 


Hence the separation between the two events as measured 
in S would be given by calculating 8¢’2 — dx’2 — Sy’? — 82’2 in 
terms of 6x, . . . dt. This turns out to be 


{[1—w?(x? +?) ]82 81? —8y? 822+ 2wydxdt—2uxdySt}t (1) 
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The separation between two events finitely separated would 
be obtained by an integration along a world line joining the 
events. As another illustration, consider the case where the 
inertial motion in a frame S’ is referred to a frame S which has 
a movement of uniform vest acceleration with regard to S’. The 
equations of transformation are 


x%' = % + g—1 cosh ki 
, Voge 


2 me 
is p sinh 7, 
for these equations give 
ger rk = At/2) 5 


and thus a point fixed in S (i.e., x = constant) has a uniform 
rest acceleration in S of amount g. (See equation (12) of 
Chapter IT.) 

The differentials are related by 


dx’ = 6% + kg sinh kt . dt 
py Oy O27 07 
ot = kg~+ cosh ki . ot, 


and so the square of an element of separation is 
hg 2622 — dx? — dy? — 82? — 2kg—1 sinh At. dxdt . (2) 


Another interesting example, due to Lorentz, consists in 
transforming from the inertial frame 5S’ to S by means of the 


equations, 
x cosh kt 


ee al em 
SU Pere seake 


x sinh Rt, 


, 


which implies that a point fixed in S has a rest acceleration in 
S’ whose value varies inversely as its x co-ordinates in S; for 


x2 — 7/2 = 42. 
The differential relations are 


dx’ = kx sinh kt . d¢ + cosh Rt . o7| 
Sie 0) 07 = OF . XS) 
8t’ = kx cosh kt . d6¢ + sinh kt. 5x) 


and thus the square of an element of separation is 


h2x?S22 — 8x2 — dy? — bz? : re Ae) 
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In general, whatever the nature of the relative motion of 
S to S’, the co-ordinates of an event in S would be connected 
with those of the same event in S’ by relations such as 


Paes AC ali) 
Bie = falx, ¥, a, t) 
z’ = f(x, y, Z, t) 
t’ = f(x, y, 2, t) 


where f,, fo, fs, f4 Would be functions depending on the relative 
motion of S and S’; and so the “ geometric ”’ field of gravita- 
tion existing in S could be said to depend on these four functions. 
The differential relations would be 


dx! = a,,8% + ay8y + a4382 + a, ,5t 
. . . . (5) 


Ol i= a4 ,d% oP a ady se Ay302 + Aq,dt 
where 
Gy, = 0} 4) 04, CLC tg, 0) 4 Ol ‘ . (6) 


The element of separation in S would be the square root of 


£ 110%? + SoodV® + £93027 + Bq 4d!” 
+ 2220x8y + 224,0%8z + 22, ,0x8t 
+ 2823002 + 22o45ydt + 2254525F : « 7) 


where 23,, etc., g44 are ten functions given by 
aon 2 
S11 = 441? — 441? — 421" — Ag, 


“9 2 > 3) 
Aga” — Aq? — Agq* — Agy 
Aq14 gq — @14449 — AgiAoq — 31439 


&44 
§12 


The change involved, therefore, in passing from an inertial 
frame of reference to one in which quasi-gravitational motion 
exists can be expressed by saying that the mathematical form 
for the square of an element of separation is a general quadratic 
function of the co-ordinate differentials instead of the special 
form with which we have been familiarised in the restricted 
theory, and that the ten coefficients are functions characteristic 
of the geometric field of gravitation. 

Now the hypothesis which Einstein makes is that there is 
an equivalence between the mathematical forms of physical 
laws in our actual frames of reference and in the accelerated 
frames introduced above. True, the equivalence is not per- 
fect ; indeed, Einstein’s law of gravitation can be considered 


I ll 
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as a definite indication of the point where the equivalence 
fails. This, however, will arise at a later stage; we shall for 
the present accept this ‘‘ Principle of Equivalence ”’ and proceed 
to develop its consequences when linked with the Principle of 
General Relativity, which require us to represent the laws of 
nature in such general mathematical forms that any transforma- 
tion of co-ordinates, however arbitrary, will not alter the form 
of these laws. As may naturally be expected, the laws in 
question are the laws of general dynamics, of electromagnetic 
theory, and of gravitation itself. 

As an illustration of this combination of Relativity with 
Equivalence, let us consider the “ natural’’ path of a “ free ”’ 
particle in S, i.e., a particle which has a uniform motion in S$’. 
We shall use in future the symbol s for the invariant separation 
between two events, so that * 


ds? = St’? — 8x'2 — dy’* — 62"2 
= g,0%7 + 2. . + 24488? 
+ 2g,,0ydz-+ .. . + 28535288. 


In the frame S’ the natural rectilinear motion of the particle 
is given by the equations of Chapter III., viz., 


hi = O04 == 0,2 = 0,0 =0 ~« °, (9) 


where the dots refer to total differentiation with respect to s. 

A very convenient method of transforming these equations 
to the variables x, y, z,¢ is that due to Lagrange, with which 
every student of general dynamics will be familiar. If we 
_write 2Z for the function, 


yy. Bad? + 2g) +... + 28524, 
it can be shown that the transformed equations are 


d(aL dx) /ds — aL [dx 
d(dL/dy)/ds — dL [dy 
d(dL/d2)/ds — dL/dz 
d(dL/dt)/ds — sL/dt 


(10) 


ol Wl 
Gio oo 


* In the preceding chapters we have used the symbol és? to represent 
(8x2 + dy’? + 822 — dt’), since we were employing imaginary time in 
order to introduce symmetry into the mathematical equations, so that 
ds was really an imaginary quantity. The necessity for this has, 
however, disappeared, as we shall see, and so this change in symbolism 
has been made. In fact, ds is the 6r of the simple Lorentz equations 
_ in the earlier chapters. 
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These, then, are the general equations of the “ natural” 
paths of a particle in the frame S. It is obvious to anyone 
familiar with general analysis that they satisfy the test of 
general Relativity, but the point will be precisely demonstrated 
later. The Principle of Equivalence is invoked if we postulate 
that these also are the forms of the equations of a free particle’s 
path in any natural frame of reference. 

Another illustration concerns the path of a ray of light in 
S. In S’ it is, of course, straight with a uniform velocity (the 
unit) ; its differential equation is 


dt’? — 8x’? — dy"? — 02”2 
or és 


O, 
O. 


So in S it is in general curved, and its differential equation is 
211007 +... + 24.00? + 22,,5%5y + . . . + 225,5x6t = O. 
For example, taking the form (4) above, we have 
h?x?dt? — dx? — dy? — 6227 = 0, 


and so the velocity of light at a “ level’ x in the frame S would 
be kx in any direction. This variation in velocity involves 
curvature of path, as is clear from other considerations.* 


* This particular illustration, due to Lorentz, has some points of 
special interest. Thus the equation of a “ falling’ body in S (i.e., one 
fixed in S is ¥ = a sech kt (where a is a constant, viz., its level at ¢ = 0) 
so that 

dx/dt = — kasech kt tanh Rt 
= — kx tanh At. 


This approaches kx, the velocity of light at x as ¢ approaches infinity, 
and never exceeds the velocity of light at the particular place through 
which it is passing at an instant. 

Similarly its acceleration d?x/di? can be worked out to be 
k*x(1 — 2%*/a?), which suffers a change of direction when the body 
reaches the level 7 = a/‘/2, causing the body asymptotically to 
peers the level = 0 with a velocity asymptotically diminishing 
© zero. 

A further interesting feature can be elicited if the equations (3) are 
written 

dx’ = a(d¥ — ust) 

SV E—nOVas 0s a= Oe 


c’8t’ = a(cdt — ufc . dx), 
where 
a = cosh kt 
au = — sinh At 
(ea li 


I, 
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A third illustration concerns the possibility of a change in 
the periodic time of ‘some natural occurrence, such as the 
vibration of an atomic radiating mechanism. 

Consider two identical atoms situate at different places 
(%1, Vi» 21), (2, Yo, 22), in the frame S. The proper time of a 
vibration at the first place would be (g,,),#5¢, (since 6x, = 0 = 
dy, = 82,), at the second (g44)otSt>. 

Now if we make the physical assumption that the identical 
properties of the atoms involve an equality of proper time, it 
is clear that d¢, is not equal to 8, if (g44), is not equal to (g44) 9, 
which is generally a fact. 

The principle of Equivalence will, then, lead us to conclude 
that a field of natural gravitation will produce a curvature in 
the path of a beam of light, and also have an effect on the 
positions of spectral lines obtained from a source of light situated 
in it. 

Passing to more general considerations, we are now in a 
position to grasp the nature of the problem facing us. We have 
to develop a method which enables us to test with the minimum 
of effort whether a given differential equation preserves the 
same form or not after a general transformation of variables 
subject to the invariance of a certain quadratic differential 
form, and we have to determine in the light of experiment and 
observation what are the conditions which must be satisfied in 
any natural frame of reference by the ten coefficients of that 


and, therefore, 
a = (1 — u*/c*)-3. 
Reciprocally we have 
6% = aldx’ — w’dt’) 
by = by’; 82 = 62’ 
cét = a(c’dt’ — u’/c’ . dx’) 
u/c’ = ufc. 


This is, in fact, a simple Lorentz transformation generalised to suit 
the assumption that the S and S’ observers correlate their measures of 
time by using ¢ and c’ as their velocities of light. Of course, the trans- 
formation could only be employed by two groups of S and S’ observers 
who happen to be adjacent, and only for occurrences in their immediate 
neighbourhood ; that is a condition imposed by the fact that the 
equations are in terms of differentials and are only true in so far as we 
can neglect the squares of the differentials. They are, however, sug- 
gestive of the momentarily existing relations between the measurements 
of length and time made by two groups of observers who pass one 
another in a given locality. A body fixed in one frame will be shorter 
in the ratio a : 1 to observers in the other. An occurrence at a locality 
fixed in one frame will take a longer time in the ratio 1 : a for observers 
in the other. 
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differential form ; in short, we have to develop a more general 
Tensor Analysis than that expounded in Chapter VI., and also 
discover a law of gravitation. 

It will prove convenient once more to adopt the symbols 
X41, Xq, X3, X4, instead of x, y, 2, ¢, bearing in mind, however, that 
x4 represents ‘‘ real’’ time in future and not “ imaginary.” 


VECTORS. 


We have aset of co-ordinates (%,, %», ¥3, ¥,), and the invariant 
separation between two neighbouring events is given by 


O52 == 910%," + 221 090% 10%, +. . . + £as0%,? Se EE) 


Let us transform to other axes of space and time involving 
new co-ordinates, which we will denote by (%,’, %’, %3', %4’).* 


There is a linear relation between the differentials, viz., 


ON == 10 40%) ee ees 
; ; : (12) 
O46 SO g0X1 es Se eae 
where @,,, . . ., @gg are in general functions of %,, 5, %5, %,. 
(V.B.—It is not in general true that ay = ayy.) 
We can write these in contracted form as before, thus : 
Oh) = 4y40%G : : =) (EZA) 
Now in these new co-ordinates 5s? will be equal to 
811 8%"? + 264 9'8%y'O%9' + ©. © + Baq'd%4?, 
where £11’, . . ., @aq’ are coefficients which are functions of 


%', X%_', X3', X%4'. But by the invariancy of 5s? this must be 
equal to the right-hand side of (11). Hence it is not difficult 
to work out that 


B11 = 4117811 + Ger'ao + Ag128e3 + Gar? a4 
+ 24 14arG19 + 211451813 + 20 44ai 814 
+ 2491431803 + 2@o14 a8 
+ 2431441854 > 
and nine other similar equations. 


* Earlier in this chapter we were using accented co-ordinates to 
refer to another point or event in the same system of co-ordinates. The 
reader should be on his guard to avoid confusion between that and the 
present use of accented co-ordinates as referring to the co-ordinates of 
the same event in another system of co-ordinates. 
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The one just written can be succinctly expressed thus : 


£11 = 401481808 » 


employing dummy suffixes, remembering that gy, = gu, (but 
not necessarily that dag = apq). 
Indeed, any one of the ten equations can be written 


Eu = @yABuab’ - : ; . (13) 


Nominally, there are sixteen of these equations and sixteen 
terms on the right-hand side of each. Really there are, owing 
to the equality of gag and gga, only ten equations, and the terms 
on the right-hand side of each can be reduced to ten. é 

Consider now a scalar function of the unaccented co-ordi- 
nates of an event, $(*,, %2, %3, %,). Expressed in terms of the 
accented co-ordinates of the same event, this is ob(%1’, %9', %3',%4’), 
and 

P(% 1, Xa, % 3, X4) = YP(%y', H's Xs’, %4'). 


For a neighbouring event these functions are ¢ + 6¢ and 
x% + 5% where , 
bf = 04/d%q . b%q 
and Sb = d/d%," . 6X4. 


These are also equal since 
b+ 86 =p + dy. 
In consequence of equations (12), 
06/d%q - O%q = dh/d%q" . (Aa18%1 + Aa2d%_ + Baz5%3 + Ga45%4) 


(there are really four terms on the left-hand and sixteen terms 
on the right-hand side). Hence, on writing out in full and 
equating coefficients of 5%,, of 5%, of dx, of 6x4, we get four 
equations, which, written out in full, are : 


9f/D% =A dap] Dy’ +. Ay 1 dyp/d% 0’ + 4g 1dYp/ 0% 3 +44, dy] 044" 
VP 2% g= Ay 2dep/D%y/ +A y2dep/D% 9! + Agnd4p/D4%Q' 4nd] dX" (14) 
dP/d% y= Ay 303h/ 0% y' +A ggd%p/ 0% 9’ + Agqdyh/9%q' + A aadtlde | 4 
VP/D% g=Az gdYp/D%y! + Ay gdY/ 0%! + Ay gdH/D%s' +-Aqgdp/ dq" 

Our immediate purpose is, however, to express the functions 
d%/dx,’ in terms of the functions d¢/d%), 1.e., to solve equations 
(14), regarded as linear simultaneous equations in 0%/d%)’. To 


this end and to avoid continual digression later, we shall now 
- collect those properties of the determinant 
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ay Ay @43 aya 
Ao Ao Ao3 Ao4 
a3) a3 A338 A34 
Agy Ago 443 Lag 


which will be required. 

This determinant we denote by |a|. To any constituent 
corresponds a co-factor, and we employ the symbol 4), to denote 
the result of dividing the co-factor of a), by the determinant 
|a|. There are sixteen of these symbols, and we can also 
construct a determinant with them; we denote this determin- 
ant by | 01. 

The following theorems can then be proved to be true by 
the theory of determinants. They are generalisations of the 
theorems in Chapter V., where certain restrictions were put 
upon the a coefficients but are now removed. 


THEOREM I. 
Galan et if mr —= pL 
= Ofb Ar 
and 
AyaP ue = I if A= us 
=O Ae 


(This is the analogue of equations (2), (3), (4), and (5) in 
Chapter V.) 

THEOREM II. 

A ee eas ae we 
(This is the analogue of Theorem I. of Chapter V.) 


THEOREM III. 


The co-factor of any constituent of the 6-determinant is 
equal to the corresponding constituent of the a-determinant 
divided by |a|, and the co-factor of any constituent of the 
a-determinant is equal to the corresponding constituent of the 
b-determinant divided by |b|. E.g., 


Do» Dog Dog 
Dso bss bs, | = ay,/\ a}. 
Das Das 44 


(Analogous to Theorem II. of Chapter V.) 
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THEOREM IV. 


The co-factor of any second minor of the b-determinant is 
equal to the corresponding second minor of the a-determinant 
divided by | @| (and similarly if a and d are interchanged). E.g., 


{ 
bs3 bsq 
43 Das 


(Analogous to Theorem III. of Chapter V.) 
Reverting now to equations (14) above, their solution gives 


DY/Dxy' = by dP/d%, + Oy2dh/d%_ + b15dh/dxy + by4d$/d%4 (15) 
and three similar equations, or 


dY/dxy’ = by ,d6/d%q 5 5 é (I5A) 
Looking at the transformation equations (12) and (15), we 
see that we have now to deal with two types of vectors or tensors 
of the first order. First, there are vectors which transform like 
d¢/d*y, 1.e., like the gradient of a scalar function ; this type is 
called “ covariant.’’ We denote such vectors by capital letters 
with the indicating digit written as a suffix proper ; so that to 
say that A,, 4,, As, A, are the components of a covariant 
vector in the unaccented co-ordinates is to say that the com- 
ponents of the same vector in the accented co-ordinates are 
related to the former as follows :— 


[@iy 9} 
— | i a is 
| Moi Ae [le 


Ay’ = by;4, + 04240 + 04343 + 014, 7 9(E6} 
and three similar equations ; 
or Ay = DA, . (164A) 


Secondly, there is the type of vector which transforms like 
the element of a “‘ world line,’’ as in equations (12). This type 
is called ‘‘ contravariant,’’ and we denote such vectors by 
capital letters with the indicating digit written as an affix on 
the right-hand side; so that if At, A*, A%, A‘ are the com- 
ponents of a contravariant tensor, then 


A’l = a1,A* + Cua + Gisa" -+ ar4 A’ . ° (17) 
and three similar equations ; or 
eA ey py We : ; SETA) 


(Of course, when the restrictions of Chapter V. are imposed 
on the coefficients, it is easily seen that a, = 0,,, and both 
types of vectors merge into one type.) 

It is easily proved from (16) and (17) that 

12 
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A) => psy vals ; > 6 ° (18) 
and AN == 10 NALee ; : peas) 


It must be borne in mind that the a and 6 coefficients are 
not constants, but are in general functions of the co-ordinates.* 

It is easily proved by equations (16) and (17), using Theorem 
I., that if A, be a covariant vector and B’ a contravariant 
vector, 


A, B't= AB f: 


in other words, A,* is invariant. 

(It is necessary to bear in mind that this is not true for 
products of the components of two covariant or two contra- 
variant vectors, i.e., A,B, is not invariant, nor A“, because 
it is mot in general true that a,)4,, = 1 if A=, and equal to 
zero if X + yw, and similarly for the 6 coefficients.) 

The converse of this proposition on invariancy is also true. 
Let A,, Az, Az, A, be four quantities such that after transforma- 
tion they become 4,’, A,’, A,’,A,’. Also let B* be any arbitrary 
contravariant vector. If after transformation it is true that 


A,'B'* = A,B, 


then the A) functions constitute a covariant vector. 
For, by hypothesis, 


A,'B'* = AgBS8 
and 5’ = bg,b", using equations (19). 

Hence 4,'B,' = Agb,gB" (there are 16 terms on the right- 
hand side), 
or {A,’ — bygAg)B’* = o. 


_ Hence as 5% is an arbitrary vector, the individual multi- 
| pliers of B”, 5’, B’, B’4 in the left-hand expression must be 
Zero. So 


A a ae bypA B; 


and from equation (16) we see that A) is a covariant vector. 
A similar converse theorem is true if we interchange the 
words “ contravariant ”’ and “ covariant ’’ above. 


© 


* For the origin of the terms “ covariant”’ and ‘‘ contravariant,”’ 
see the note at the end of the chapter. 
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TENSORS OF THE SECOND ORDER. 


If A, and B) are two covariant vectors, and if we write down 
the sixteen quantities A)B,, it is easy to see that on transforma- 
tion we have 


A)'B,! = byabupA Ba, 


there being sixteen such equations and sixteen terms on each 
right-hand side. 

Any set of sixteen quantities which transform in this fashion 
constitute a tensor of the second order, and we denote such 
quantities by using double suffixes, as Ay,, By,, etc., so that, 
after transformation, 


Ayu! = Dyadupdap,  - + «+ (20) 
and as a consequence of this 
Ady = Gay4pu4 ap’ 2. 3 Se er) 


(See equations (18) and (19).) 

Similarly a contravariant tensor of the second order is 
defined by sixteen components A, which satisfy the trans- 
formation relation 


Ai ty a pA, : ; yoy (22) 


involving also 
AKAD Uae ; ; « {Z3) 


The tensors A,B, and AB are referred to as the “outer” 
products of the two vectors A, and B,, or A* and Be. Such 
products form a special class of tensors of the second order, the 
general tensor not being expressible as such a product. 

There is a third type of second order tensor called “ mixed.” 
Its transformation equation is similar to that of the “ mixed ”’ 
product A,B. A group of sixteen quantities forming a mixed 
tensor is denoted by such symbols as A,#, By#, etc.; being 
covariant with respect to the suffix A, and contravariant to the 
affix yw, its transformation equation is 


Ayt= 0. d.gA iP, ; ; : «9 (24) 
involving also 
Avi 00g Ae” : a 1e5) 
By Theorem I. it is easy to see that the sum of the four 
components of A)“, which have suffix and affix equal, is 
imvariant. In fact, 
ne =a ee 


IZ 
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This is an example of a process called “ contraction” of 
mixed tensors which will presently be explained more fully in 
connection with tensors of still higher orders. Indeed, the 
invariance of A,B+, already established, is an example of the 
contraction of a mixed tensor product. 

A tensor of the second order whose components satisfy the 
conditions Ay, = A,,, or AM“ = APA, is called a “ symmetric ”’ 
tensor, covariant or contravariant, as the case may be. If the 
conditions A), = — A,), or A = — APA, are satisfied (involv- 
ingA., = A= A; = Ai 0, OAM = 6 — cic) the tensor 
is called ‘‘ antisymmetric,’ and precisely as in Chapter V., can 
be considered as a “ six-vector’”’ (covariant or contravariant). 

A reference to equations (13) and (21) shows that the 
g-coefficients constitute a symmetric covariant tensor of the 
second order. This tensor, on account of the great importance 
of the expression for the invariant 6s, is generally called the 
“fundamental ”’ covariant tensor. In this case there is a slight 
departure from the usual notation, the suffix being attached to 
a small letter instead of to a capital. 

Let us consider two second order tensors A), and Bb, 
covariant and contravariant respectively ; we can show that 
the sum of the sixteen terms A,gb*8 is invariant. For by 
equations (20) and (22), 


A ag D8 = AyalsgbYd.,qbsgA 5: 


There are 256 terms on the right-hand side. Fixing for the 
moment on definite values of y and 5 (i.e., considering sixteen 
of these terms alone), it is not difficult to see that in view of 
Theorem I. 


> (4yaya4sgb3g) (16 terms) 
ap 


= (Yayabya)(LAsebsg) (each bracket contains 4 terms) 
a B 


=Ix I 
= I. 


Hence the coefficient of each of the terms A.3ByY on the 
right-hand side is unity. Hence A,gB*8 is invariant. (This 
is a further example of contraction of a mixed product.) 

It is somewhat tedious but not difficult to prove the converse 
of this result, just as we did for vectors. If A), is a group of 
sixteen quantities which transform in such a way that A,gBo® = 
Ap B'8, where Bb is any arbitrary contravariant tensor of the 
second order, then Ay, is a covariant tensor of the second 
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order. A similar theorem can be proved when “ covariant ” 
and “ contravariant are interchanged. 

This result enables us to discover a fundamental contravariant 
tensor, for if we take the co-factors of the ga, in the determinant 
|g| and divide each of them by the value of |g], the sixteen 
quotients form a (symmetric) contravariant tensor. Denoting 
them (in anticipation) by the symbols g’“, we know by the 
theory of determinants that 


ST rea is Saal ee Solent 4 lee 
and three similar equations ; or 


DLaagr® — i, 


which proves the contravariancy of g in the light of what we 
have just demonstrated. 
Indeed, 
Sul = TH A= p 
Ses, eae 


It can also be shown that the product of the determinant 
of the ga, functions and that of the g functions is unity, or 


el tie | 
A relation exists between |g| and |g’| whichis of consider- 
able importance. It will be as well to deal with it now. If we 
write out the determinant |g’| in full and the determinant |a@| 
of the coefficients a, in equations (12), and employ the rule 
for multiplying determinants, we obtain as their product the 
determinant 


, , , , 
G18 a1 Fa18a2 Faas Fab a4 
/ 
Ba2§ a1 Fa2ba2 Fak a3 Fas a4 
Aasar Ba38a2 Fas8a3 Vas8aa 
, , 
Aaa§ar GaaSa2 Gas8as QasSaa 


where each constituent is the sum of four terms. 

If we again multiply this determinant by |a|, we obtain a 
determinant of which each constituent is the sum of sixteen 
terms such as @,)4g,g.g ; in fact, it is the determinant |g| as 
a reference to equations (13) will demonstrate. 

Hence 

ete bap 


Presently we will see that the nature of the space-time 
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continuum is such that the g-determinants are everywhere 
essentially negative. Assuming this result for the time being, 
and dropping the bars at the side of the symbols for conveni- 
ence, we see that the determinant of the a-coefficients used in 
the differential transformation from unaccented to accented 
co-ordinates is equal to ./(— g)//(— g’). 

This result is of some importance in connection with the 
concept of four-dimensional volume. Taking a body at rest 
relative to an observer, and multiplying its ordinary three- 
dimensional volume by the interval of time between two events 
connected with that body, we obtain a “ four-dimensional ”’ 
volume. Referred to a frame in which it is moving, the body 
has a different volume and shape, but if the motion be uniform 
the shape and size are definite, and if we multiply this volume 
by the interval between the two events as measured in the 
new frame, we arrive at the same result as before. That we 
already know from the earlier chapters on the Lorentz trans- 
formation. We are, in fact, dealing with a cylindrical four- 
dimensional volume whose generating lines are the parallel 
world lines of the surface particles of the body. But if the 
motion in the frame be not uniform, the shape and size of the 
body are altering continuously, the world lines of the surface 
particles are not parallel, the three-dimensional section of the 
world tube of the body is varying, and in order to obtain the 
four-dimensional volume embracing the body at all instants 
between the events we must multiply its three-dimensional 
volume at an instant by a differential element of time in the 
frame and integrate. The final result has not the same value 
as before, but the relation connecting such four-dimensional 
volumes of a body between two events can be arrived at by 
applying a theorem due to Jacobi, which shows that in virtue 
of the transformation equations (12), 


fax ax dxsdx sy =f iff \a\dx dx dxdz,: 


or, in view of the fact that at each definite point instant 


[a] = J/(— g)//(— 8’), 


we can write it 
MIS 8) 4% ax dx ghx = f[Ir/ (— g)dxy'dxq'dxq'dx'. 


This invariant integral is equal to the integral ////dxdydzdt 
in a Galilean frame, for in it g = — 1, since its constituents 
have the values g1, = 822 = £33 = — I, 244 = I, and the rest 
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zero. The integral ////dxdydzdt is the “natural” volume. 
Einstein points out that if g should vanish anywhere, an 
elementary small natural volume would correspond to a 
finite co-ordinate volume dx, dx, dx; Sx, This he assumes 
not to be the case anywhere in space-time accessible to us, 
making, in fact, a definite hypothesis concerning the physical 
nature of the continuum. If g does not vanish anywhere in 
any frame it preserves the same sign ; and as it is negative in 
the restricted Relativity theory, its constant sign is negative, 
so that ,/(— g) is a real quantity. 

We can employ Theorem IV. to demonstrate the existence 
of a six-vector “‘ reciprocal’’ to a given six-vector. Thus, if 
Ay, bea covariant antisymmetric tensor or six-vector, its trans- 
formation equations are 


Draw bag 
Dua bup 


A awe = 


A ag, 


where there are six terms on each right-hand side and the suffix 
aB goes through the sequence, 23, 3I, 12, 14, 24, 34 in the 
summations. It is easily seen that the antisymmetry is pre- 
served in the accented co-ordinates. If we use Theorem IV., 
we see that the thirty-six determinant coefficients can be re- 
placed by coefficients which are second minors of |a@], each 
divided by |a|. Going through a procedure similar to that in 
Chapter VI., we find that A ,,/a, A24/a, A34/a@, A 23/4, Ag,/a, Ay,/a@ 
transtorm intG Ay,, Aa, , Age, Aos, 451, A1y by exactly the 
same equations as the components of a contravariant six-vector 
in this order, B??, B31, B12, Bi4, B%4, B%4, transform into B’®, 
B’31, B22, B14, B44, B4 But |a| = /(— g)/,/(— g’). Hence 
we prove that if A,, be a covariant six-vector, there exists a 
contravariant six-vector B, such that A,, = ./(— g)B", etc., 
eed (— 2). Bis “ reciprocal’ to-Aj,. 

Similarly, if A* be a contravariant six-vector, there exists 
a reciprocal covariant six-vector By,, such that 


ea) (8), 2 5 AO = Baal (= 2). 


TENsoRS OF HIGHER ORDER: OUTER AND INNER 
MULTIPLICATION : CONTRACTION. 


We can proceed by very obvious generalisation to tensors 
of higher order than the second. Thus a covariant tensor of 
the third order consists of sixty-four components Aj,,, such 
that the components in accented co-ordinates are related to 
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those in unaccented co-ordinates by the sixty-four equations 
as poe Dr adupoyyA apy» 


each of which has sixty-four terms on the right-hand side, as 
a, 8, y each go through the sequence I, 2, 3, 4 independently. 

Particular cases of such tensors are the “ outer’”’ product 
of a covariant tensor of the first order (or vector) and one of 
the second, A,,B,, or the outer product of three covariant 
vectors A,B,Cy. 

Similarly a contravariant tensor of the third order has its 
components in two systems connected by 


AAW ANd ply AN, 


and special cases are A*“BY and A*BrC”. 
As before, we can prove the invariance of 


A aByB ee 


which is the sum of sixty-four terms, and also derive a corre- 
sponding converse theorem. Here we have a further example 
of “ contraction.” 

A mixed tensor of the third order may either be of the type 
Any or ApH, i.e., covariant with respect to two indexes, and 
contravariant to one, or vice versa. In the former case, 


A’ nu! = DyabuparyA ap? ; 
in the latter, 
A')MY = Dra upayyA FY. 


Special examples are such mixed products as A,,B” or 
A,B,C, and A,B, or A,Bec’. 

By a more general application of the process of contraction 
than we have hitherto used, we can derive vectors or tensors 
of the first order from mixed tensors of the third order. Suppose 
for example we consider Ay g*; this means the sum of four of 
the components of the mixed tensor A,,”. There are four such 
sums, viz., 


3 
and As peg 


That is, we have selected sixteen of the components of A nyne 
which have the affix equal to the second suffix, and summed 
them in tetrads in a definite way. It is not difficult by 
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means of Theorem I. to show that these four quantities con- 
stitute a covariant vector ; or 


A’pa® = bagA pa". 


Similarly we can “ contract” the mixed tensor A“, into a 
contravariant vector A, for which 


A'da, = aygABa,, 


When we apply this process to the mixed tensors obtained 
by outer multiplication of tensors, the results are termed 
“inner”? products. Thus A,,B¢ in the “ inner ” product of 
Ay, and B’, and AB, of A and B,.* 

The invariant A,B* is, of course, the inner product of the 
vectors A, and Be. 

A tensor of the fourth order consists of 256 components. 
Its transformation equation if covariant is 


‘ 3 Ps Pr = Dada p0 uy vsA aByd> 
if contravariant 
ARMY = Ayal BA ypAPPY®, 


if mixed, 
A ee = Diab rBOpy4vsA See. 
/ 
or A rh! = Pra rpapy4ysA apr’; 
or Aid” = by. Arpad pyar oP. 


There are 256 terms on the right-hand side. 

“ Contraction” of Ayay” gives Axaa%, i.e., involves a selec- 
tion of sixty-four components of A,,,”, arranges them in groups 
of four and adding these tetrads, gives sixteen quantities which 
have the properties of a covariant tensor of the second order, 
ie, 

Ig BP Ps ae bxp0ryA Bya”- 


Similarly A*A*, is a contravariant tensor of the second order. 
Taking A,,#”, we first contract it into A,q"*, which is a mixed 
tensor of the second order, and if we apply the process of con- 
traction once more we obtain the invariant Ag,*, of which 
we met an illustration above in the invariant quantity A.gb*. 
We have examples of contracted tensors in the “ inner ” 
products A,,_B* (covariant), AB, (contravariant), etc. 
The process of contraction can only be applied to mixed 
tensors; it reduces the order of the tensor by two, and 


* These are the analogues of the vector product [A . §8] of a vector 
and a tensor of the second order in Chapter Wale 
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ultimately, after as many contractions as are possible, leaves a 
covariant or contravariant tensor or an invariant according as 
the number of suffixes is greater than, less than, or equal to 
the number of affixes. 

A symmetric tensor of the third order only involves twenty 
numerically different components, for any six which involve the 
same suffixes or affixes (numerically different) in different orders 
are equal, and so twenty-four of the components reduce to 
four. Then Aye = Aery = Aagy and Agos = Ayes = Acie, 50 
that these six reduce to two and five other similar sets of six 
reduce to ten. There remain the four, A1,;, Asoo, Aggs, Aaaa- 
Thus there are twenty in all. 

If the tensor be antisymmetric there are only four numeri- 
cally different components. All those involving at least two 
equal suffixes or affixes are zero; as regards the remaining 
twenty-four with different suffixes, we have, e.g., 


A 48 or A x39 = Asi2 = a Asn = A ea1 aa A o13 


Hence the transformation equations in this case reduce to four 
in number, viz., 


Dra byp Dry 
Angas == "One bup buy | Aagy (four terms). 
va byp vy 


where on the right-hand side afy goes through the sequence 
234, 314, 124, 321 in the summation, and Ap is replaced 
by the same groups in succession. Remembering that 
|a| = ./(— g)/(— g’), we can, by Theorem III., prove that 
if Aa, is covariant and antisymmetric, there exists a contra- 
variant vector BA such that ,/(— g)B1 = A o34, ./(— g)B* =Agqy, 
a/(— g) B® = Ayes, a/(— 8) B4 =Agq,. Similarly if A” is 
contravariant and antisymmetric, there exists a covariant 
vector B, such that B,/,/(— g) = A4, etc. 

In a four-dimensional continuum there are no symmetric 
tensors higher than the fourth order, and an antisymmetric 
tensor of the fourth order has only one numerical value for all 
components such as A494, A4342, etc., which are not zero. 


ASSOCIATED TENSORS. 


Considerable use is made of the process of contraction in 
connection with the fundamental tensors g,, and g“. Indeed, 
it reveals the tensor nature of these quantities very directly. 
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Thus gagd%o5%p (i.e., 21:5%,? + etc.) is invariant (8s?) ; so that, 
writing it in the form 


(214 8% _)5% 1 + (820 bXq)d%Xy (Bsa bX q)OX3 + (840 8% q)5%4, 


and remembering that 8x,, 8%,, 3x3, 6x, form a contravariant 
vector, we see that 


is NaO%a 


is a covariant vector. Therefore, by the converse of one of the 
propositions above, this must be the mixed product of the 
contravariant vector 6x, and a covariant tensor of the second 
order gu. 
Again, write 
by Y= fe d%a, 


so that dy,, dy, dy3, dy, constitute as stated a covariant vector. 
By solving we get 
0%, = POH). 


The right-hand side is, in consequence, a contravariant 
vector, and must therefore be the inner product of the co- 
variant vector dy,, and a contravariant tensor of the second 
order g*¥. 

From g,, and g** we can form a mixed tensor of great 
importance by multiplication and one contraction. The outer 
product is ga, g“*, a mixed tensor of the fourth order. Con- 
tracting by putting vy = x = a, we get g,. g’. Wewrite this as 
g\“, and by the theory of determinants we know that it is unity 
ifA = v, and zeroifA + v. Hence 7m any co-ordinates the mixed 
fundamental tensor g“ has the values given by the scheme 


oI: O O O 
O IT O O 
O O if 10) 
O Oo O a6 


As a matter of fact, one can satisfy oneself very easily that 
this scheme transforms into itself by application of the mixed 
transformation equation 


& x ra DratupfaP. 


A further reduction gives gagg*8, Which is an invariant and 
has, as a matter of fact, the value 4 in all co-ordinates. 

We can “ associate ”’ tensors of different order and character 
_ with a given tensor by use of the fundamental tensors. 
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For example, we can derive a contravariant vector from a 


covariant vector thus: 
Ar = BA ‘ 


or a covariant from a contravariant thus : 
A ,=— 8g uA a: 


Likewise, for tensors of the second order, we can derive 
contravariant from covariant, or vice versa, or mixed tensors 


from. either. [¢., 
AM == phe BBA ag 


The following theorem concerning the reciprocals of associ- 
ated antisymmetric tensors of the second order will be of service 
later. 

Let Ay, and A be associated antisymmetric tensors, and 
let B be the tensor reciprocal to Ay, and Ba, reciprocal to A, 
then the tensor associated with Bb is the negative of By,. 

To prove this we must bear in mind that the cofactor of 
any second minor of the g\ determinant is equal to the cor- 
responding second minor of the g,, determinant divided by g. 
This is, in fact, Theorem IV. above applied to g. 

Now, taking, say, the 12-constituent of A, we have 


Al# — glap?84 ,. (12 terms) 


lao1p | 
grag? | A ag (6 terms) 
where, in the second line, a8 has to go through the sequence 
23;-31, 12, 14, 24, 34. 

Hence 


Bul/ (— 8) = 


S2y89s | BP//(— g) (6 terms) 


§ay85 


(where yd corresponds to a8, as 14 does to 23, or 24 to 31, or 
34 to 12); 

; = — (83784357°)/,/(— g) (12 terms), 

icem B= = Say sb. 


Other components can be treated likewise and the theorem 
proved. 

We have so far dealt with these tensors whose transforma- 

tion equations involve coefficients which are not constants but 
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functions of the co-ordinates, so as to parallel and extend 
certain results in Chapter VI., but we have not yet touched 
on differentiation. We saw in that chapter that simple 
differentiation yields further tensors directly. For the more 
general tensors now introduced this is not true, except in one 
case, viz. the gradient of a scalar function. But nevertheless 
a process which depends upon differentiation can be applied to 
these tensors in order to yield further tensors. This process, 
called “‘ covariant derivation,” is of the utmost importance in 
the application of tensor theory to General Relativity, where we 
have to express laws of nature in the form of covariant differen- 
tial equations. 


NOTE ON THE TERMS “ COVARIANT ’’ AND ‘‘ CONTRAVARIANT.”’ 


Let us suppose that at a point in space-time we have chosen four 
fundamental unit vectors which we shall denote by U;, Uz, Us, Uy. 
The vector displacement, whose components are §%, 8%, d%3, #4, is 
equal to the vectorial sum 


64%,U, + 6%,U, + 6%3U; + 6%,0, . ‘ o (be) 


If we transform to another set of fundamental unit vectors 
U,’, U,.’, U;’, Uy’, and the components of the same vector displacement 
are 6%,’, 6%,’, 6%,’, d%,’, then (I) is equal to 


§%,/U,’ + etc. : : ‘ ar2) 


If now the equations (12) of the chapter hold between the com- 
ponents $x, and $%,’, it is easy to prove that the relations between the 
two sets of fundamental vectors are 


Uy = Gar0a 
or U)’= age : : - as) 


It is clear, therefore, that the components of the vectors, which are 
affected by a suffix in the text, are variables which are transformed 
cogrediently to the fundamental unit-vectors, and so vectors possessing 
such components are called covariant vectors. On the other hand, the 
components of the vectors which are affected by the affix are trans- 
formed contragrediently to the fundamental unit-vectors (this is so 
because of the identity of the vector A¢U, with A’-U,’), and vectors 
with such components are termed contyavariant vectors. 


CHAPTER IX. 
GEODESICS IN SPACE-TIME. 


Let P and Q be two point-instants in space-time, and consider 
any world-line joining them. Near to this world-line let 
another world-line be drawn, also joining P and Q. To any 
point-instant A on the first world-line let there correspond a 
point-instant A’ on the second world-line, such that the separa- 
tion between P and A along the first line is equal to the 
separation between P and A’ along the second, i.e., 


fas = fas. 
PA PA’ 


In general, the point on the second line corresponding in this 
way to Q, regarded as on the first, is not Q itself ; let it be Q’. 

Writing 22 for the quadratic function in dx,/ds, .. ., 
dx,/ds, with which we are now familiar, viz., gug%a%_ (the dot 
signifying the total differential coefficient dx,/ds), and which 
is everywhere equal to unity, we have 


We Ove Oke OL Ot es =O) 


where 8x, refers to the first order change in the co-ordinates 
on moving from a point A on one world-line to the corresponding 
point A’ on the second; and 8x, refers to the changes in 
dx,/ds (which are analogous to the direction cosines at a point 
in a curve in two or three dimensions). We can analyse 8%, 
more closely by considering a point B on the first line adjacent 
to A, and B’, its corresponding point on the second line, so that 
the co-ordinates of B are x, + dx,/ds . ds, etc., and of B’ are 
Xx, + 8%, + d(x, + 8x))/ds . ds, etc., the differential ds being 
the separation between A and B or A’ and B’, for by the method 
of correspondence used these separations are equal. But the 
co-ordinates of B’ can also be derived by varying those of B; 
so they are 


(%, + dx /ds . ds) + 8(%, + dx,/ds . ds). 


Igo 
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Hence, by comparison of the two results, we see that 


d(5x)/ds a 3( x /ds) 
= 8%). 
Hence 


OL /d%q- O%q + IL/d%q . d(8x,) /ds = o. 
The second term on the left-hand side is equal to 


A(dL/d%Xq . 8%,)/ds — 5x d(dL/d%,)/ds. 
Therefore 
(OL/d%_ — d(dL/d%,)/ds)8xq + d(dL/d%, . 8%,)/ds = 0. 


If this expression is multiplied by ds and integrated along 
the world-line PABQ from P to Q, we obtain 


AdL/d%q — A(dL/d%q)/ds)8xgds = — dIL[d%q . 8%Xq. 
PQ 


Where, on the right-hand side, )L/d%q is estimated at QO and 8x, 
refers to the differences between the co-ordinates of Q and those 
of QO’. 

Now if the original world-line PAQ happens to be a “ geo- 
desic’”’ (i.e., analogous to a shortest or longest length across a 


surface in two dimensions between two points), so that /ds is 


b “cc s ” . A PQ 
maximum, or minimum, or “ stationary ”’ along this line, then 
fds along any arbitrary adjacent line differs from the integral 


1% 

iene the first by a quantity of the second order. Consequently, to 
this order Q coincides with Q’; or, on the right-hand side above, 
5%, are all of the second order, and so this side can be equated 
to zero. 

Thus the integral on the left-hand side is zero when integrated 
along a geodesic. But the 6x, in each element of the integrand 
is entirely arbitrary as the total separation along the geodesic 
is stationary for variation to any arbitrary neighbouring line. 
It follows that the factor multiplied by each of the 8x, is also 
zero. So finally we arrive at the result that 


dL /d%, — d(dL/d4,)/ds = 0, 


provided d/ds refers to differentiation along a geodesic line. In 
other words, these four equations are the differential equations 
of a geodesic line in the continuum. 
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Since 
IL /dx, = %aX pap] dX) (16 terms) 
dL |d4, = Baka (4 terms) 
and = d(dL/d%)/ds = gya%q + Xakpr¥ja/d0%e (20 terms), 
it is not difficult to collect terms and show that these four 
differential equations are 


Lyrae + [aB, Alxa%e =O (4 terms in %, 16 in #4), 


where 

[my A] = $(dprfd%y + Wal O%p — 2Buy/d%a). 
(uv, A] is called Christoffel’s three-index symbol of the first 
order. 

These four equations (obtained by putting A = I, 2, 3, 4 in 
succession) may be looked upon as four simultaneous equations, 
%1, Xe, X%3, X4. To solve them for %,, say, we multiply them by 
g11, g?1, 931, g41 respectively and add. 

Since g“lg,, = I and g%!g,. = 0, etc., we obtain 


% + (gM (a, 1] + g*[aB, 2] + g*[aB, 3] + gM laB, 4])4a%8 = 0 


or Xx, + {aB, 1}%,%— = 0, 


where 
{uv, rj = g[yr,a]. 


{uv, A} is called Christoffel’s three-index symbol of the second 
order. 

This expresses %, in terms of a quadratic function of the %, 
and we have the differential equations of a geodesic in their 
most compact form as 


%, + {aB, A}t.%p = 0, . : ~ 
where A is put equal to I, 2, 3, 4 in succession. 


COVARIANT DERIVATIVES OF TENSORS. 


It was stated in the last chapter that if A) is a tensor of 
the first order the sixteen partial differential coefficients 0A )/d%, 
do not constitute a tensor of the second order, as is the case 
when dealing with the general Lorentz transformation. As a 
matter of fact, it appears that the sixteen expressions 


0An/d%, — {Au, aA, 


are components of a covariant tensor of the second order. 
Each of these components consist of a four term expression 
linear in Aj, Ag, As, Aq, in addition to the differential coefficient. 
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Similarly if A is a contravariant tensor of the first order, it 
appears that the expressions 


VAN Ixy + fap, ALAa 


constitute a mixed tensor of the second order. These derivatives 
are called “ covariant ’’ derivatives because in each case the 
number of suffixes attached to symbol representing the derived 
tensor is greater by unity than the corresponding number for 
the original tensor. 

We can prove these statements by beginning with a special 
type of vector and afterwards generalising in an obvious way. 
Thus, taking the gradient of a scalar function ad CAR a eee PEE) oo 
we know that d¢4/d%) is a covariant vector, and 


d0/d%q . dx,/ds 


is invariant, where d@x,/ds are the ‘direction cosines” of a 
geodesic at the point (%,, %2, %3, ¥,) and constitute a contra- 
variant vector. In fact, geodesic lines remain geodesic in any 
space-time frame of axes. This property which a world-line 
possesses in virtue of the separation along it being stationary, 
is independent of axes, and so dx,/ds are bound to transform 
into dx,'/ds along a geodesic after any transformation. Differ- 
entiating again with respect to s totally, we see that 


d(d$/d%_ . d%,/ds)/ds 


is also invariant, since it is the limit of the invariant value of 
the quantity d4/d«, .dx,/ds, at a point-instant B minus the 
invariant value of the same quantity at a neighbouring point- 
instant A on a geodesic, divided by the invariant value of ds 
along AB. Hence 


36/d%q . d2x,/ds® + 97h/dx%,d%g . Ax,/ds . dxp/ds 


is invariant. as 
But since ¥, and %) refer to total differentiation along a 


geodesic line, therefore %, = — {af, A}x,%, and so 


(294 /2x,dx9 — {aB, y}d]2xy) ap 
is invariant. 

Now % is a contravariant vector, and so the outer product 
%,%, constitutes a contravariant tensor of the second order. 
Further, since there are an infinite number of geodesics through 
a given point-instant, %),%, may be regarded as any arbitrary 
tensor in the above. In consequence of this invariance and 
_one of the theorems in Chapter VIII., it follows that 


13 


194 RELATIVITY 


926 /dx%, IX, — {Aw, a}dh/d%q 
constitute a covariant tensor of the second order, which is the 
covariant derivative of the covariant vector 0¢/d%). 
Let A, be any covariant vector, and let ¢,, ¢, ds, a DE 
any four scalar functions of %1, %, %3, 4. After transformation 


let 
’ , , é 
A ,(%1, %2, %3, %q) = By(%y', Xo, Xe, Xe i ale eRe 


and (4s, Xe, a, Xs) = Pa, Ha, He picg Frere. 


It is very necessary to note that B, is not A,’, etc., for, of 
course, A,’ = 6,,4, and + B,; in fact, B,, B,, Bs, B, are not 
the components in accented co-ordinates of the vector A). 
Consider the vector 


A 1d44/d%, + Agdho/d%, + Agd¢3/0%) + A d$4/d%), 
which transforms into the vector 


1.e., Baddsg/d%y’ = b),A gdb8/d%q. 


Now each term of Agddg/dx, is a definite multiple of a 
gradient, and therefore has a covariant derivative; therefore 
the vector Agddg/d%, has a covariant derivative. By putting 
Oy == 01, Pe = Xe, Oy = on Oe = Sanwwe see that thesvcecrorm 
Agoddp/dx, becomes the vector A,. Hence A) has a derivative 
which is a covariant tensor of the second order, viz., 


dA y/d%u = {Au, atA g. 


We shall denote it by the symbol (A)),. Incidentally, we 
have also proved that any covariant vector can be expressed 
as the sum of four special vectors of the gradient type. 

To deal with covariant tensors of the second order we begin 
with an outer product of two covariant vectors A, and B). 
The outer product of the (Ay), by B), and of (By), by A, are 
individually covariant tensors of the third order. It is also 
clear that the summation of corresponding components of two 
tensors of the same kind and order will yield a tensor of that 
kind and order. Hence 


Ay(By)y + By(Au)y 


is a covariant tensor of the third order. But this is easily seen 
to be 


0(A,B,)/d%, — {Av, a}A aPy — {uv, a}A)B,. 
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Now in a manner similar to that adopted above we can show 
that any covariant tensor of the second order is the sum of 
sixteen special tensors which are each outer products of co- 
variant vectors. In fact, if A, is the tensor, it can be expressed 
as 


A 4106,/d%) . d64/d%u A 190h4/0% . 9p 9/dXp +. etc. 
where $1 = %1, b2 = Xo, $y = %3, fa = %y 


So the process of covariant derivation can be extended to 
any covariant tensor of the second order and yields 


dA ay/dXy a {Av, aA au = {uyv, atA xa, 


which we denote by (Aa,),. 
(The position of the dummy suffix must be carefully noted 
in each of the last two terms, for in general Ay, + Aya.) 
These results can easily be extended to higher orders ; for 
instance, the fourth order covariant derivative of Any, 1.e., 
(Anny), 1S 
VA aps/d%e — {Ak, a}Aauy — {ux, a}Arw — {vK, a}A rua 


If we wish to deal with contravariant or mixed tensors we 
can do so most readily by the method of “ association ” referred 
to at the end of the last chapter. An alternative way of expres- 
sing the relation between the two Christoffel symbols is required 
for our purpose. Thus since 


Au, v} = g@ (Au, a] 
it follows that = gxafAu, B} = geagh*[Ap, a] 


= [Ap, ] 
for £8 = Tit a = «K 
= Oi @ == 'k. 
So that Au, v] = SrafAp, a}. 


To proceed, let A* be a contravariant vector; then g),A* 
is the “ associated ”’ covariant vector. Hence 


(Ladd *) /d%u a {ip B}epad* 


is the eu component of a covariant tensor of the second order, i.e., 


GnadA4/IXp + (xa! IX. — [Kp, a]) Ae 
is covariant. 
It is easily seen that the factor within the brackets () is 
‘the three-index symbol [ap, «], so 


Ti 
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LuadA%/dxp + [ap, KlA% 
is a covariant tensor of the second order. If multiplied 
(externally) by g”, it becomes a mixed tensor of the fourth 
order. On contracting by putting v= « =, we obtain a 
mixed tensor of the second order. But g{apu, B] = {ap, A}, 
and g5v_, =1ifa=A,and =oifa+-A. Hence 


VAA/I%X~ + {ap, A}A* 
is a mixed tensor, and is the covariant derivative of A’, 


denoted by (A4),. 
Similarly the covariant derivative of A™, i.e. (A),, is 


dArA/dx, + fav, AJA + fav, w}Ar. 
The covariant derivative of the mixed tensor A}#, i.e. 
(Ay*),, is 
0A H/d%, — fAv, a}A gH + {av, w}A 2. 


DIFFERENTIALS OF THE FUNDAMENTAL TENSORS. 


Since g,eg’8 is constant (being unity if w = v, and zero if 
pe = v), it follows that on varying the values of the co-ordinates 
in the g,,-functions, 


gbeug = — &upog’?. 


Multiply each side by g,, and sum for the four values of v; 
we obtain 
Brak OS up = — rakupds*? 
fey Oru = — Srakupds?? ; 


or, in terms of partial differential coefficients of the &au-functions, 
we have 


Waul dX = — Laakypoe’s/dxy, . : 1) 


which expresses any partial differential coefficient of a com- 
ponent of the covariant fundamental tensor in terms of the 
ten partial differential coefficients of the components of the 
contravariant tensor with respect to the same co-ordinate. 

In a similar manner we can show that 


oghu/dty = — gregwBdgap/d%, . .  . (3) 
It is easy to show that 
dEu/d0X, = [Av, w] + [pr, A], . , S24) 


and so by (3) 
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dg [Ivy = — greguB([ay, B] + [By,a]) (32 terms) 
= — grav, w} — grB{By, A}; 
or, since the particular letter chosen for a dummy index does 
not matter, 


Way/ My = — (gA*{av, p} + gtafav, Ay) Ex <e7(5) 


If || is the determinant of a series of constituents k,,, 
Ria, .. ., Rg, and K,,, Ky,, . . ., Ky, represent the corre- 
sponding co-factors, it is known by the theory of determinants 
that if the constituents individually receive small variations, 
then the variation of |k| is equal to 


Ky,8k, + Ky.6kg +... + Kg dR gy. 

Hence, since gg“ is the co-factor of g), in the determinant g, 
it follows that 
; Og = geFogig 
See d log g = de/g = g**Seu8 . : (6) 
or, if we wish to deal with the logarithm of an essentially 
positive quantity, 

6 log (— &) = 8% o8ag, 


so that d log (— g)/d%, = gdg48/d%) : 79) 
Now b*eap = 4, 
hence Lapr878/d%, + SF wap/dx%, = O, 


and therefore 
2 log (— g)/d*%, = — gapdg*8/dx, - + (7A) 
Combining (7A) with (5) we obtain 
d log (— g)/d%, = Sanler{yA, B} + B7{yA, a}) 
= {PA, B} 


ada, a 


= 2{ha, a} : : : - (6) 


an extremely useful result for later work. 
Another way of writing (8) is 


dlog (— g)#/d4%, = (— g)~#2(— g)#/dx, = fra, a} . (8a) 


CuRL, DIVERGENCE, AND LORENTZIAN. 


We can now proceed to deal with the analogues of certain 
vector operations employed in Chapter VI. 

In the first place, it is easy to prove that if A), is the Ap- 
‘component of any covariant tensor of the second order, and 
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A, is its pA-component, then A,,—A,a constitutes the com- 
ponents of a six-vector when Aw goes through the sequence 
23, 31, 12, 14, 24, 34. —In particular, 4,5, — A,B, is a six 
vector, and is analogous to the six-vector product of two 
four-vectors in Chapter VI. 

Let (A,), be the covariant derivative of A), then 


(Ay)u — (Au)a = 04)/0%y — 4, /0*) 
for fAu, a} = {pA, a}. 


It follows that 3A)/d%, — dA,/dx, is the component of a 
six-vector which is the “ Curl” or “ Rotation” of A). 

It is perhaps as well to point out that the operation 
“Curl,” when applied to a contravariant vector, does not 
yield a six-vector, because {ap, A} + {aA, pw}. 

We can, however, obtain a scalar Divergence of a contra- 
variant vector, for (A*), is a mixed tensor, and so by contraction 
(A*), or 

dA4/dx_ + {Ba, a}AB 
is invariant. 

Hence by (8A), writing g for (— g)3, 


dAB/dxg + g—1A%dqg/dx 


is invariant ; or, since the letter used for a dummy index does 
not matter, 


g—d(gA%)/rx, . P : « (9) 


is invariant. This is the analogue of Div A of Chapter VI. 

There is no corresponding function for a covariant vector, 
unless we introduce it via the associated contravariant vector 
pha A - 

We now come to analogues of the four-vector Lorentzian of 
a six-vector. 

Let Ay, be an antisymmetric covariant tensor or covariant 
six-vector, then (A,,), is a covariant tensor of the third order 
(the components for which A = pw are zero). A study of the 
transformation equations shows that 


(Anu)y oe: (Aw)a Ss (Ava)u 


is an antisymmetric tensor of the third order. 


‘ Since {Au, a} = {uA, a} and Ay, = — A,, it easily follows 
that 


9A ny /d%y -f- VA py/ da + 0A ya/d%Xu 


is an antisymmetric tensor of the third order if A), is anti- 
symmetric. 
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From a result in Chapter VIII. (p. 186), we see that if oe 
is a covariant six-vector, there exists a contravariant vector Br, 
such that 

qB« = (Aay)y + (Aw), + (Aya) u 
— dA yy /d*, ++ dA yy /d%Q + 9A a/d%u . : (ro) 
where x, , #2, v are replaced in turn by 1, 2, 3, 4; 2, 3, I, 4; 
ot. 2, 4; 4,3; 2, 1. 

Also, in Chapter VIII. it has been proved that there exists 

a contravariant six-vector R** reciprocal to Ay, such that 
A,, = qk" 

and five similar equations. 

Introducing these into (10), we easily prove that 

q~ *0(gR*)/d%q 

is a contravariant vector which we can denote by the name 
“ Lorentzian of RA,” 

This last result can, as a matter of fact, be proved directly 


without any appeal to reciprocity. 
Thus let A* be a contravariant six-vector, then 


(AA#), = dArM/dx, + fav, AJA + {av, w}Ar 


is a mixed tensor of the third order. By contraction we obtain 
a contravariant vector (A*8)g which is equal to 


VAM/dxg + {aB, AB + {aB, B}Am, 


Owing to the antisymmetry of A™* and the equality of 
{uv, A} and {vp, A}, the second term, when summed for the a 
and f affixes, vanishes. Hence 


(A™)g = 0A*8/dxg + {aB, BJA 
= A /dx%g + g—1A42Dq/d%q. 
Or, altering the dummy index, we see that 
q—10(gA*)/d%q - F : cal ( EE) 


is a contravariant vector, being, in fact, the contracted covariant 
derivative (A*), of the six-vector A™. 

There still remains an analogous operation for a mixed 
tensor of the second order, and in this case it is not of necessity 
antisymmetric. Thus 


(A)4), = dA yh/d%y + {ay, pA * — {rv, a}A gf 


is a mixed tensor of the third order. 
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By contraction we obtain a covariant vector (A )*)g which is 
dA 8/dx— + {aB, B}A\* — (AB, a}. oP 
or 0A 8/dxg + g—1Ay22q/d%— — {AB, a}A.8. 


After an obvious adaptation of the dummy indexes we have 
as the contracted covariant derivative of A) the vector 


(A%)q = 7 10(GAy*)d%q — fAa, BJAp™ - (12) 


We can express the last term in this result somewhat 
differently by employing the associated covariant or contra- 
variant tensor ; for 


tha, B}Ap* = g87[Na, y]A p* 


== [Aa yj} AS7: 

Hence 

(Ay*)a = 97 10(GAy%)/d%_ — [Aa, BJA... (12A) 
Also, 

(Aa, B]A*? = gargr? Ays[ra, B] 
= 3A yagy7g?(dgng/d%—_ — Waal dx + Wap/d%a) 
= $A,9g7%g8?de,8/d%x, (since the other terms cancel in 
the summation) 

Hence 
(Ay)a = 97 *0(GAg*)/O%a + PA apmgR/OX, = (TB) 


Collecting these latter results, we see that there exist 
functions which we can call: 

(x) The six-vector Curl or Rotation of a covariant vector. 

(2) The invariant scalar Divergence of a contravariant 
vector. 

(3) The contravariant vector which is the Lorentzian of a 
contravariant six-vector, or of the reciprocal of a covariant 
six-vector. : 

(4) The covariant vector which is the Lorentzian of a mixed 
tensor of the second order. 


THE ‘‘ RIEMANN-CHRISTOFFEL ”’ TENSOR. 


Having thus established the method for the covariant 
derivation of tensors from tensors, it seems very natural to 
apply the process itself to the fundamental tensor g,,. The 
result, however, is disappointing, for little trouble is required 
to show that (ga,)y is zero. 
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Yet it is of vital importance for the development of the 
analysis of a four-dimensional continuum to obtain a tensor 
derived by differentiation from the fundamental tensor. The 
following method arrives at such a tensor called after the two 
men who independently discovered it. 

Suppose we apply the method of covariant derivation twice 
to any arbitrary covariant vector. The first derivation gives 


(Ay)u = 04)/d%, — fr, B}Ag, 
and the second derivation gives 


((Ay)u)» = (Ay) u/d%, — fav, af(Aa)u — {ur a}(Ap)a 
d(04y/d%4 — tru, BJA g)/dxy 
— iu, a}(2Aq/% — fay, BYAp) 
= {py, a}(04)/d¥_ — tra, B}Ag) 
= 097A y/d% pdx, 
— {Au, B}dIAg/dx, — fArv, a}dA,/d%, — {uv, a}dA)/d%q 
+ AaQv, a}{am, B} + Ap{uv, a}{Qa, B} 
— Agd{ru, B}/d*,. 

If we interchange the order of derivation we can write 
down in a similar way the expression for ((A)),),, and it is 
easily seen that the first four terms and the sixth in ((A)),), 
make their appearance in ((A)),),. As the difference of these 
derivatives is a tensor, it appears that 


(d{Av, B}/d4%, — d{Au, Bj/d%» + fry, as{ap, B} — fru, ajar, B})Ap 


is a covariant tensor of the third order. It is obviously a 
contracted product of the covariant vector A, and a mixed 
tensor of the fourth order, viz., 


d{Av, KY dX — fru, K}/dx,y 4 {Av, a}{ap, x} — {Ap, al{ar, x} (13) 


which we denote by Ra,,* since it is contravariant as regards 
the symbol x. It is obviously a function of the g,, functions 
and of their first and second differential coefficients with respect 
to the co-ordinates. 

We can also derive a covariant tensor Raw of the fourth 
order by association. It is gxgHtay*, and is equal to 


Pre dee) ori = 2p, Baek Qo allen, A) — Ow, afer, 6) 
= 0(gna{Av, B}) [2% — fAv, B}2gp/d%, 
pe (enptAy, B})/d%, + ru, Bhd8up/ ity 
ein ae atone 
== d[Av, «]/d%p. — be ee ) — Om, XC para 
+ frv, a}([ap, K] — Weal dX) — {Ap, af(lav, kK] — nal! o%y 
= ue aye — dfAp, «]/d% + {Ap, a}[kv, a]— {Av, a}[Ku,a] (14) 


I 
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The covariant tensor (14) is sometimes denoted by the 
symbol (xAuv), called a “ four-index ’’ symbol of Christoffel. 
A little investigation will show that 
pie) Geni eee ae 
(KApr) = (Axvpe) = (ured) = (ypAr) 
= — (Acpr) = — (Ave) = — (ypxd) = — (pvc), 


i.e., interchanging « and A or pw and v alters the sign, and dis- 
placing «A to the position of uy leaves the value unaltered. 
Further, 


(KAuv) + (KpvA) + (KvAn) = 0 


(which gives a relation when three of the index signs are per- 
muted and one is not) is a result which can be easily proved by 
writing out in full, when it is seen that the terms cancel in pairs. 

These equalities show that there are only twenty numerically 
different components which are not zero. Thus: 


Those of type (kkk) are all identically zero. 
= . (KKKA) - ce 
Pe . (KKAd) a ee 
” ” (KKAw) ” ” 

There remain 6 of the type (KAAk) or Rare 
Fe * ia. yy (Aux) OF Rayne 
- - 7 eae ys (KApY)- On Tea 


i.e., twenty in all which are not zero and are in general numeri- 
cally different. 

Similar conclusions hold for the components of the tensor 
LS 


THE LAw oF MOTION OF A PARTICLE. 


The object of the development of this tensor analysis for 
a “‘non-homaloidal”’ space-time is to obtain a mathematical 
symbolism suitable for the formulation of laws of nature which 
will pass the Relativity test in any frame of reference. The test 
will be satisfied if a law can be expressed as an equality between 
tensors, or as the equality of some linear function of tensors to 
Zero. 

It is very natural to choose (1) as the law of motion of a 
particle of matter “ under no forces,”’ because, in the first place, 


Xy + {aB, \}ia%~ 
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is a contravariant vector,* and, secondly, it is the law of motion 
in a Galilean frame, for in that case, since &an is zero, the 
equations reduce to 

%) = O, 


which we know to be the law of motion from the investigations 
of the earlier theory of Relativity. 

If we choose a frame of reference which is free from gravita- 
tion, the path of a particle ‘‘ under no forces,” i.e., unaffected 
by pressures, pulls, electromagnetic forces, is determined by 
%, = 0. The path of the same particle, still unaffected by such 
forces, in another frame is %, + {a, A}%,%g = 0, where the 
&a. functions which occur in the coefficient {a8, A} are deter- 
mined by the law of transformation of co-ordinates from the 
one frame to the other. This path is, of course, curved and 
independent of the nature of the particle ; it has all the features 
of a path in a field of gravitation ; in this case we call the field 
“‘ fictitious ”’ or “‘ geometrical’ or “‘ non-permanent.”’ Einstein 
assumes that this is also the equation of motion of a particle 
“under no forces ” in an actual or “‘ permanent ”’ gravitational 
field. The assumption is based on the Principle of Equivalence, 
and satisfies the general Principle of Relativity. He can still 
refer to the particle as “under no forces”’ (meaning forces 
arising from material contact or electromagnetic action), 
because he has by this procedure removed gravitation from the 
category of force and made its manifestations arise from the 
_ metrical relations of space-time, i.e., the expression of the 
invariant separation between two point-instants in terms of the 
particular system of co-ordinates chosen. In fact, from the 
manner in which equation (1) was derived, we can say that the’ 
natural path of a particle in a gravitational field is that which 
makes the separation between two given point-instants of its 
history the greatest possible. The world-line of the particle is 
a geodesic. In a region free from permanent gravitation, the 
natural path between two points is the shortest possible, so 
that given the time of passage, the separation is a maximum, for 


5s2 = dt2 — bx? — by? —- 82? 
or ds = Ot(1 — v3, 


* t, + {aB, Hap = %a()%a/d%. + {8a, A}¥g), and the expression in 
the brackets ( ) is a component of the covariant derivative of the 
contravariant vector 7,. Hence the right-hand side is the inner product 
of a contravariant vector and a mixed tensor of the second order, and so 
is itself a contravariant vector. 
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and so, given 8, 5s is a maximum when v is smallest, which is 
the case for the straight path of shortest length. On transform- 
ing co-ordinates, the curved path in the new space frame of 
reference retains the same character, since 6s between two 
definite events is invariant, and if it is maximum for a definite 
sequence of events in one frame, it remains so in any other. 

It is very necessary to go warily at this point. It is a 
matter of pure mathematics only to say that (1) is the equation 
of motion of a particle in a geometrical gravitational field such 
as we derive by transferring ourselves from a Galilean frame 
to one in variable motion relative to the former ; at least, it is 
so if we admit that x, = 0 is the law of motion in the Galilean 
frame. But we invoke the Principle of Equivalence when we 
say that (1) is the equation of motion in a permanent field of 
gravitation ; for sucha field cannot be removed by a transforma- 
tion of axes in the way in which a geometrical field can be 
removed. 


EINSTEIN’S LAW OF GRAVITATION. 


This brings us at once to the closer consideration of those 
characteristics which mark off permanent and natural fields of 
gravitation as a special class from all conceivable fields which 
might be introduced by all conceivable modes of co-ordinate 
transformation. Such characteristics must be expressible in 
the form of equations satisfied by the ga, coefficients, equations 
which indicate the nature of the changes taking place in these 
coefficients continuously as a point-instant changesin space-time, 
i.e., differential equations. Such equations abandon all notions 
of “‘action at a distance,’ and determine the motion of a 
particle at a given position and instant, not by an appeal to a 
distant body, but to the metrical conditions of the portion of 
space-time in which this event is located, these conditions being 
related step by step to conditions round all other point-instants, 
and ultimately to those regions of space-time which contain 
the world-lines of all particles of gravitating matter. 

We now perceive why it is so important to obtain tensors 
which involve the differential coefficients of the g,,-functions. 
If the law of gravitation is to be subject to the Principle of 
Relativity it must involve such tensors. We have obtained 
one such tensor, the ‘“‘ Riemann-Christoffel,’’ and any other 
tensors which we can derive from it by the methods of con- 
traction and association. Our first impulse is to try the law 


xg =0 é : C . (15) 
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which, as has been indicated, is really a collection of twenty 
independent equations.. Unfortunately, this law, although true 
in any gravitational field which can be derived by a mathema- 
tical transformation from a Galilean frame, i.e., in a geometric 
field, cannot be true in a natural field. To see this, we have 
only to note that (15) is certainly true in a Galilean frame, 
because there the gy,-coefficients are constants ; and since after 
a transformation the tensor components of R,,,* in the new 
co-ordinates are linear functions of the components in the old, 
(15) must still be true in the new co-ordinates. Hence (15) is 
a necessary condition in any frame of reference if the gravita- 
tional field in it can be removed everywhere by a suitable trans- 
formation of co-ordinates. It can also be proved that (15) is 
a sufficient condition for the possibility of such a removal. But 
it is an outstanding feature of actual gravitational fields that 
they cannot be removed everywhere by a suitable choice of 
axes. Fix the axes on a small piece of matter moving naturally 
through the field, and gravitation is removed in the immediate 
neighbourhood, but not at places at any moderate distance 
from the origin. Consequently equation (15) is too stringent 
a condition to impose upon the gy,-coefficients. Linstein, 
realising this, and bearing in mind that any more general 
equation which would be satisfied both in permanent and 
non-permanent fields must include (15) as a particular case, 
has suggested instead of (15) the equations 


Dae ——} 0) . . ° . (16) 
i.€., Rygat + Rapa + Rags® + Raga* = 0. 


These are clearly satisfied if the individual R,,,* components 
are zero, but they are also consistent with other than non-per- 
manent fields. As the process of contraction is involved, the 
expression equated to zero is a covariant tensor of the second 
order, and so the equation (16) is covariant and satisfies the 
Principle of Relativity. This contracted Riemann-Christoffel 
tensor is usually denoted by the symbol G,,, and thus Einstein’s 
law of gravitation for points where there is no matter (or 
electromagnetic energy) is 


Cie OL pe Wales, (LOA) 


Since Gy, is a symmetric tensor, on account of the symmetry 
of gy, there are ten such equations, but it will appear later that 
there are four identical relations between the G,, components, 
which therefore reduces the number of independent equations 
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in (16A) to six. It is clear that the equations involve the 
differential coefficients of the ga,-components up to the second 
order. 

Before proceeding further it will be convenient to write 
them more explicitly. 

By (13) (altering the-dummy suffix), 


Rruat = 0 Aa, a}/dXp — d{A : a}/d%q + {Aa,, BY B, a} 
, =D, BHA, 0}. 


But by (8), 
{fAa, a} = d log g/dx). 
Therefore 
Gray = — dfAu, a}/d%q + fra, BHwB, a} 
+ d? log g/d%)d%, — {Au, a}d log g/dx%q - hE) 


We should expect that as Newtonian theory gives such a 
close approximation between observation and calculation, the 
equations (16) would degenerate under suitable conditions to 
the equations of Newtonian theory, and we shall deal with this 
matter at once. Schwarzschild’s method of obtaining an exact 
solution of Einstein’s equations corresponding to the field of a 
single body such as the sun will be given later ; but so as not 
to interrupt the general development of the theory, we shall, for 
the time being, make use of the following conclusions, which 
can be obtained from an inspection of the solution. (See 
Chapter XIII.) Regarding x,, x, x; as Cartesian co-ordinates 
referred to axes with the Sun as origin and x, as time, gy, differs 
from plus or minus unity by quantities of the order M/s, if 
A =p, where M is the gravitational mass of the sun; the 
gravitating effect of the planets will not introduce any differ- 
ences except of a lower order of magnitude. At the orbit of 
Mercury M/r is of the order 10~7. If A4= and neither A nor 
fe is 4, 2a, is actually of the order M/r. If A= 4, gy, may be 
treated as zero. They are zero in Schwarzschild’s and Einstein’s 
solutions, since these solutions refer to a static field. Through 
the motion of the planets, however, in the frame of reference, 
the field will alter at each point by relatively small amounts 
with lapse of time. In any case, gy4 is not only small, but it 
is also involved in terms of the equations which are of a rela- 
tively low order of magnitude, as the reader can verify very 
easily for himself at a later stage. It has also to be observed 
that g is equal to — I, or approximately so. As regards the 
differential coefficients, 
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d8ru/d%, is of the order M/r? if A, w, v + 4 


dW a4/ dX, ” ” »” v = = 4 
dWau/dx, is of a lower order or zero if y= 4 and A orp = 1, 2, 3,4 


= " . - ‘ fF j= 4andAory + 4. 


From these can be worked out the order of magnitude of 
the [ ] symbols, and then of the { } symbols. 


{Au, v} is of the order M/r? if X, pw, v + 4 


44 Py xs “ » v4 

{44,4} ,, : A= 4 

£4, v} is of a lower order : or zero if Aor vy = 4 
fu, 4}, E of ifAorp+#4 
{44 i 22 . » 


In the differential equations of motion for a particle, viz., 
(1), it will appear in a moment that dx,/ds is of the order 
unity in this approximation, and so dx,/ds, dx,/ds, dx,/ds are 
comparable with the components of a particle’s velocity in the 
field, and so are of the order (M/r)? in general. 

In the fourth of equations (1) we can neglect those of the 
terms {a8, 4} in which both a and f are not 4. Hence 


%, + 2{a4, 4}4,%4 = O. 


{a4, 4} = g*8[a4, B] 
= g*4[a4, 4] 


Now 


OK AS. aXgaS 


and hence d?x,/ds* 
— g4d¢,,/ds . dx,/ds. 


l ll Ih 
| 
OQ 
~ 
a 
OR 
~ 
= 
y 
8 


To this order of approximation 


git = Tae 
Hence 
ax 4[ds* Ag 4,/as 
ax 4/ds 844 
or log (dx,/ds) + log g4, = constant, 7 
Le. dt/ds or dx,/ds = constant/g 44. 


This (in general) approximate equation is actually exact in 
the case of Schwarzschild’s solution (see equation (2), Appendix 
to Chapter XIII.). Since 1 — g4, approaches zero as we recede 
from gravitating matter, and as 5¢ approaches 6s under similar 
circumstances, the constant is unity. Hence approximately 


di/ds = I/2 44. 
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Near gravitating matter, therefore, 5¢ > ds (since g4, = I — M/r), 
or the time of an occurrence in a gravitational field is greater 
than its proper time, which is the general statement covering 
the displacement towards the red of the solar lines. 

The first three of equations (I) can be thrown into a form 
involving d/dt and d?/dt? as follows. (We use ¢ as alternative 
tow,) Put dx,/ds — 6, we optam 


B2dxy2/dt2 + BdB/dt . dx,/dt + {aB, rA}B2dx,/dt . dxg/dt = o. 
The fourth equation is 


BdB/dt + {a8, 4}B%dx,/dt . dxg/dt = o.* 
Hence 


d?x/dt? = — ({aB, A} — {aB, 4})dx,/dt . dxg/dt 


are the three equations of motion (A = I, 2, 3 in succession). 
As a first approximation we only keep terms of the order 
M/r? on the right-hand side, and we find 


d?xy[dt® = — {44, 0}. 


This is so since dx)/dt is of the order (M/r)# for A = I, 2, or 3, 
but of the order unity if A = 4. 
Now 
{44, A} = er[44, a] 
= g[44, A] (not summed) approximately 


= — [44, A] 
= — $28 44/dX). 
Hence 
d?x/dt? = d6/d* 
if fa = 1 — 20: 


which emphasises a result, already known in special cases of 
“geometric ’’ fields, that g4, differs from unity by twice a 
Newtonian potential function. 

Turning now to the first order approximations to Einstein’s 
law of gravitation, let us consider the equation 


Gy, = 0; 
or 
d{44, a}/dxa — {4a, B}{4B, a} = 0° log g/dx,? — {44, a}dlog g/d%,. 
The first term on the left-hand side involves terms of the 
order M/r’ at the highest, i.e., of the order M/r . 1/72 ; the second 


* 8 = dx,/ds is not, of course, to be confused with the @ in the index 
symbols. 
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term involves terms of the order M?/r4 or (M/r)?. 1/72, and 
so can be neglected in comparison with the first term. On the 
right-hand side both terms involve no terms of lower magnitude 
than M?/r4. Thus the equation reduces to 


d{44, a}/d%_ = 0; 
or (d7/d% 4? + d2/dx0? + d2/d%52)e44 = 0, 


which, in view of the interpretation of g,, above, is Laplace’s 
equation for the Newtonian potential function. 

It will be as well also to refer to what has been called the 
“ official attitude ” of the relativist towards co-ordinates. 

We have already seen what the procedure is in a number of 
cases when dealing with Restricted Relativity, and it is essen- 
tially no different in General Relativity. A law of nature is 
submitted as true; if it be cast in the covariant mould, i.e., if 
its form be such that it is obviously a set of linear equations 
between linear tensors, all is well ; if this be not so, the equation 
must be generalised so that it takes on this desirable property 
and degenerates to the original form in the special frame of 
reference for which that form is known to be “ true to nature.”’ 
The great strength of the Relativity standpoint is due to its 
success in divesting all the fundamental laws of Physics of 
those accidental features which appear in their expressions in 
terms of some particular type of co-ordinates, chosen in some 
special frame of reference. By the very nature of tensor analy- 
sis in a four-dimensional continuum, the equations are quite 
symmetrical in the four variables %,, %2, %3, ¥4. There is abso- 
lutely nothing in the equations themselves to indicate that any 
particular variable is “time,” and that the other three are 
“space ”’ co-ordinates ; or, again, that three of the co-ordinates 
are Cartesian, polar, cylindrical, ellipsoidal, etc. We have been 
calling x, ‘‘time’”’ so far; but that was merely a formal 
convenience. Wecould have called %, or x, or x; “ time,” and 
chosen x, as a ‘“‘space”’ co-ordinate. Of course, this arises 
from the unification of space and time which distinguishes 
Relativity theory, and is analogous to the same lack of definite- 
ness in the older forms of physical laws as regards Cartesian 
co-ordinates, where the interchanging of x, y, and z produces no 
contradiction or error. In fact, %1, %2, %3, ¥4 are simply any 
tetrad of quantities which uniquely define an event. Our 
method of selecting such tetrads is a matter of pure con- 
venience. When we believed in the absolute nature of space, 
we attributed a definite form to a body for all observers ; no 
matter what kind of co-ordinates they used to define the points 
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on its surface, it preserved this invariable form which was 
independent of the co-ordinate system chosen. We no longer 
accept this, but we now ascribe this invariant property to the 
world-lines of matter; these have a definite form, and are 
quite independent of our choice of co-ordinates. The inter- 
sections of these world-lines are the coincidences in observation 
which form almost the entire store of material which we collect 
and classify, in order to discover sequences in phenomena and 
to frame laws which summarise such sequences, and the succes- 
sion and order of such intersections is entirely independent of 
any numerical method which we adopt so as to distinguish 
them from one another. We are familiar with so-called 
‘curvilinear ”’ co-ordinates in three-dimensional space. Three 
families of surfaces are chosen, such that every point in space 
lies on one and only one member of each family ; each member 
of a family is numbered, and the co-ordinates of a point are the 
three numbers belonging respectively to the member of the 
first, of the second, and of the third family on which it lies. 
Similarly in a four-dimensional continuum we can choose four 
families of three-dimensional continua, so that every “ event ”’ 
is numbered by the four members, one from each family, to 
which it is common. 

But this breadth of view, although so comprehensive when 
expression of natural laws is in question, leads to a little per- 
plexity and doubt when we are dealing with measurements and 
observations of definite phenomena in a definite frame. The 
relativist cannot “ officially’ start off by saying: ‘‘ Choose 
Cartesian co-ordinates, or polar co-ordinates, etc.’”’ He knows 
that in doing so he is involving himself in difficulties similar to 
those which would meet a hypothetical two-dimensional being 
who, while living in a universe that was a surface curved in 
three dimensions, nevertheless assumed his universe to have the 
metrical properties of a Euclidean plane. What he must do is 
to start off with a set of coefficients or ‘‘ potentials,” with which 
to express the square of an element of a world-line. Any par- 
ticular set of ten analytical expressions which he chooses is a 
legitimate set if it obeys the “ law of gravitation,” i.e., satisfies 
some tensor law involving the differential coefficients of the 
&ay-functions. At present Einstein’s law—equation (16)— 
holds the field. Presumably, if the ten expressions are solu- 
tions of it, they possess that particular property which they 
must have in order to suit the physical nature of our space-time 
continuum, a property which is quite independent of special 
frames of reference or systems of co-ordinates. Now although 
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the Einstein equations (16) are formally symmetrical, it does 
not follow that the individual expressions for 21, 212, 213, etc., 
in a legitimate set are symmetrical in the co-ordinates, i.e., that 
&12, for example, involves the co-ordinates x, and x, in the 
same way as g3, involves the co-ordinates x, and x,, etc. 
Bearing this in mind, suppose we have discovered an exact 
solution, or, at least, a sufficiently approximate one. The 
question now arises: What do %,, %», %3, ¥, ‘‘ mean ”’ in these 
particular expressions for ga,? Here we turn gratefully to 
Euclid and Newton, for we probably notice that our special 
expression for 6s? tends towards some expression with which 
we are already familiar in the restricted theory, when the x) 
approach some limiting values. Perhaps, for example, 8s? 
approaches 
8%,2 — 8% 2 — 5%,2 — dx,? 


aS %,, %,, x,» approach infinite values. We at once jump to the 
conclusion that x, is “‘ time,’ and x), x,, x, are “ rectangular 
Cartesian co-ordinates.”’ Or it may be that our expression 
bears a reasonable resemblance to 


8x2 — 5%)? — %)76% 2 — %)? sin? %,,0%,7, 


and we assume that %,, %), %», %, are “time’”’ and “ polar 
co-ordinates.” But, although such guesses are extremely ser- 
viceable, there is still a small element of doubt in the situation, 
unless our experimental appliances can be refined beyond the 
degree of precision as yet attained. This is a point we shall 
return to when obtaining solutions of the equations. 

Another important detail which needs emphasising is con- 
cerned with a limitation of the Principle of Equivalence which 
has been tacitly introduced. The Principle apparently states 
that the laws of nature which hold in a non-permanent field of 
force (i.e., in a frame accelerated with reference to a frame 
which is Galilean over a finite region) will also hold in a 
permanent field. This statement is certainly invoked when 
equation (1), which is true in a non-permanent field, is assumed 
to be the law of motion in a permanent field. But it is denied 
when equation (15), which holds in a non-permanent field, is 
assumed not to be true in a permanent, and is replaced by (16). 
In short, the law of gravitation is a law which limits the prin- 
ciple of equivalence and restricts a statement which as a mere 
generality would be useless, since without a method of deter- 
mining what we may call “legitimate” or “ natural ”’ poten- 


tials, we could not effect a solution of any gravitational problem, 
Tat 
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however true in the abstract the Principle of Equivalence might 
be. Eddington stresses this point in his “‘ Report,’’ and enun- 
ciates the Principle in a strict form. We cannot do better than 
quote him. In Chapter II. (p. 20) he says: “ For an infini- 
tesimal region the gravitational force and the force due ta a 
transformation correspond ; but we cannot find any transforma- 
tion which will remove the gravitational field throughout a 
finite region. It is like trying to paste a flat sheet of paper on 
a sphere—the paper can be applied at any point, but as you 
go away from the point you soon come to a misfit. For this 
reason it will be desirable to define the exact scope of the 
Principle of Equivalence. . . . The necessary limitation turns on 
the number of consecutive points for which gravitational space- 
time agrees with homaloidal space-time ; in other words, the 
equivalence will hold only up to a certain order of differential 
coefficients ;’’ and later, in Chapter IV. (p. 42): “ The 
difference between a permanent gravitational field and an 
artificial one . . . is that in the latter case equation (24. 1) 
[our equation (16)] is satisfied, whereas in the former the less 
stringent condition (26. I) [our equation (15)] is satisfied. 
These equations determine the second differential coefficients 
of the g,,, so that we can make the natural and artificial fields 
correspond as far as first differential coefficients, but not in the 
second differential coefficients. We shall therefore state the 
Principle of Equivalence as follows : 

“All laws relating to phenomena in a geometrical field of 
force, which depend on the gy, and their first derivatives, will also 
hold in a permanent gravitational field. Laws which depend 
on the second (or higher) derivatives of the ga, will not neces- 
sarily apply.” 

When stated thus the principle covers the adoption of (1) 
as the equations of motion, for they do not contain second 
differential coefficients of the potentials. 

A considerable simplification of the equations (16) can be 
introduced if, as Einstein suggests, the co-ordinates are so 
chosen that g = — 1. This condition is satisfied in the special 
Relativity theory ; but, of course, the adoption of it here as 
an a posteriori condition involves no renunciation of the general 
standpoint, for the condition does not of necessity involve 
constant values for the ga,-functions. Thus we might find a 
set of ga, which satisfy (16) but do not make g equal to — 1. 
Now by any mathematical transformation we can find another 
set of g,,. These would still satisfy Einstein’s law, for since 
Ga, is a covariant tensor, 
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Gay = bradusGap, 


and so, if Ga, = 0, it follows that Gy,’ must also be zero. 
Moreover, since 


‘ & = 10 e. 
it might be possible to choose the transformation, and hence 
the transformation determinant |}|, so that |b?|e = — 1, and 
thus satisfy the added condition. In general equation (16) is 
Pru + Dru = O, 
where 
- Pau = — dfAy, a}/d%—a + fra, B}{wB,a} . (18) 
an 
Pau = log g/dx) dx, — {Au, a}d log g/dxq. 22 (EG) 
With the condition g = — 1 the equation becomes 
Pu O0 =. : ; = (20) 


It is very necessary, however, to bear in mind that although 
we have divided the tensor G,, into two parts, these parts in 
general are not individually tensors, and so as to make this 
point explicit, we have not indicated either part by capital 
letters. Ifg = — I, da, is atensor, but not otherwise. A very 
good illustration can be given here of the many pitfalls which 
wait for the unwary and into which it is only too easy to tumble 
at a first reading, for, as Eddington remarks, there is “no 
royal road to Relativity.”” Seeing that y%, is the covariant 
derivative of 3 log g/dx,, which should be a covariant vector, 
since it is the gradient of a scalar function, why is y%,, not a 
covariant tensor ; and so why should the remaining part, ?y,, 
not be a tensor also, since the whole is a tensor? Now it is 
true that if 

O(% 4, Xa, Xa, Xa) = X(¥1', Xa’, Xg', Xa); 


then 290@/d%, is a vector whose components in the accented 
co-ordinates are dy/d%)’. But, in general, g is not equal to g’, 
and so log g is not in general equal to log q’. In consequence, 
the gradient of log g in unaccented co-ordinates is not the same 
vector as the gradient of log g’ in accented co-ordinates, and 
the fallacy in the reasoning above is obvious. 

The particular choice which Einstein has made for his law 
of gravitation has an element of arbitrariness in it. This point 
will be discussed later, but it can be stated now that no other 
tensor equation can be obtained which involves a tensor of the 
_second rank and contains only first and second derivatives of 
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the potentials, and the second derivatives in the first degree. 
Other suggestions have been made, such as 


ert = 0, 


but it can be shown that this leads to a set of equations equiva- 
lent to (16). At present Einstein’s law of gravitation appears 
to be the simplest possible assumption in this tensor analysis, 
and so far its conclusions agree so well with observation as 
not to necessitate any resort to a more complicated tensor law. 


CHAPTER X. 
ELECTROMAGNETIC EQUATIONS IN A GRAVITATIONAL FIELD. 


FRom this discussion of Einstein’s law of gravitation and the 
limitation thus imposed on the Principle of Equivalence, we 
now return to the further development of the tensor analysis 
which is required for the application of the general Principle of 
Relativity to other natural phenomena. 

In Einstein’s theory inertia and gravitation are fused into 
one concept by means of the ten potentials. In Newtonian 
theory an absolute frame “free from force ’’ is postulated ; in 
this frame any body travels uniformly in a straight line, or 
pursues the shortest path ; this is due to its inertia. True, it 
may be deflected from this path by the mechanical or electro- 
magnetic action of other bodies in the frame, but such deflection 
is not inherent in the postulated frame. In real frames deflec- 
tions inherent in the frame are actually observed. These are 
attributed to a special cause, gravitational force; and so 
inertia and gravitation are set over against one another as 
universal but yet distinct properties of matter. In Einstein’s 
theory they are not so contrasted. In any frame real or hypo- 
thetical, all bodies pursue naturally the path of maximum 
separation. This may be curved or not, according to the nature 
of the frame. Its form is determined for all bodies by the 
values of the g,,-functions at each point in the frame. Thus 
Einstein is not compelled to introduce a “ gravitational force ; ”’ 
both gravitation and inertia are contained in the values of the 
&ay-functions, and the equations of motion 


d?x)/ds? + {aB, A}dx,/ds .dxg/ds =o . Smee) 


require no terms on the right-hand side. 

Yet we know that, even in the “ inertial frame ”’ of Newton, 
bodies free from gravitation would not travel straight if they 
affected each other mechanically or electromagnetically ; 
neither, we may surmise, will they pursue the path of maximum 
separation in Einstein’s theory in similar circumstances. We 
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may naturally ask what becomes of the equation of motion in 
this case. After the course adopted in Chapter VI. there is no 
difficulty in answering this question. We have seen that the 
left-hand side of (1) isa contravariant four-vector. Consequently 
the general equation of motion for a particle in any frame, when 
affected by non-gravitational influences, is 


hy -+{aB,\Mote= PA . . . @) 


where PA is a contravariant vector, whose form depends on 
those influences, and which we would call the ‘‘ force-vector.”’ 
If we proceed as we did in earlier chapters, we will endeavour 
to show that at all events electromagnetic forces and their 
activity are capable of being brought into the form of a contra- 
variant four-vector; but before we can do so, we shall have 
to discuss the question of the general Relativity of the equations 
of the electromagnetic field, and so we shall apply ourselves at 
once to the study of that problem. 

It is but natural to turn back to earlier chapters once more 
for guidance, and we find it to hand. In Chapter VI. we saw 
that in the restricted theory the expression of the field equations 
in covariant form was obtained by means of four- and six- 
vectors, reciprocal six-vectors, and the Lorentzian operator. 
In Chapters VIII. and IX. we find that these tensors and this 
operator have their analogues in non-homaloidal continua. 
Our course is therefore quite clear.* 

The field equations were written down in Chapter VII. in 
the form 

OL jo 0X =) (4 equations) 
and dF yy/d%, + Wya/dX%p_ + IF ay/d%, = O (4 equations) ; 


or the second tetrad could be expressed as 
WRjal 0% = 0; 


where 2, is reciprocal to Fy,. 

The tensors and vectors involved the components of field 
and current. One simplifying feature is now absent. We 
must take account of covariance and coniravariance. For 
example, we can easily see that if we wish to create a current 
density vector it will be a contravariant vector, and so we shall 
have to use the symbol J* for it and not Jy. It will be, in fact, 
the vector podx,/ds, where py will be an invariant quantity, 


* The use of x, for imaginary time in Chapters VI. and VII. was. 
only a matter of convenience, and involves no theoretical change, 
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viz., the density of charge measured in a special frame which 
will be identified precisely a little later. Recalling the result 
(10) of Chapter IX., we satisfy the Principle of Relativity if we 
can throw the equations of the field into the form 


Wyy/dX + IRya/ Xp - IRay/dXy = gy] 5 é (3) 
and OF yy /d%\ + Wya/d¥y + IP ry/d%, = O : eta) 


where Fy, and Ray are covariant six-vectors or “ field-tensors.”’ 
Equation (3), of course, involves 


d(gJ*)/d%a = 0, 
the “ equation of continuity.” 
Taking the fourth component of J*, we see that 


qJ* = pdx /ds. 


Let us denote this by p. Now consider the group of frames 
of reference which are Galilean for a given point, and especi- 
ally that one in which the point is momentarily at rest. If 
(Yzy V2, V3, V4) are the co-ordinates in this frame of the event 
(%1, %2, %3, %,4), then by a result in Chapter VIII. (p. 182) 


99% 18% 98% 59% 4 = SY OV dV 30) 4 
and hence poG4%4/ds . 8x ,5%.5%3 = pody,/ds . dy by yz, 
Le, PO% 10% 0X3 = pydV OV 20s, 
since dy,/ds = 1, for we have chosen a Galilean frame in which 
the element of volume is at rest. Hence, if we identify pg as 
the ‘“‘ proper’ charge-density at a point, p will be the measure 
of the charge-density in the gravitational frame where 
p = 9Bpo(B = ax,/ds), and is the generalised Lorentz factor 
for volume element moving in the gravitational frame. In 
fact, since d%x,/dx,, d%_/dx,, dx,/dx, are the velocity components 
Vy, V2, V3, 


Be (20s 28. s+ efi +: foes 


This measure of p ensures the tmvariance of the charge on 
a body. Moreover, since 


gJ = qBpot%1/dx4 = pry, etc., 


the right-hand sides of (3) become the three components of 
current-density at a point, and the charge-density at a point as 
measured in the gravitational frame. 

The next step identifies the components of F’,, and Ry, with 
_ the components of the field (three-) vectors. In the general 
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theory of the field for material media, we have four such 
vectors, the inductions and the intensities, the equations taking 
the form 


dd/dt + pv = curl zs 
divd =p ; ; - (5) 

— db/d¢ = curl ° 
divb = 0 ; : oan) 


and to obtain solutions, algebraic relations between b, d, h, e 
have to be postulated, usually simple linear relations. 

It will be found very convenient to take this form of the 
equations in a gravitational field even m vacuo. It is then 
easy to identify (5) with (3) and (6) with (4) if the components 
of the covariant field tensors (antisymmetric, of course) are 
assumed to be 


B= 1 w= 2 [Be 5) Se, 
Fy.= 0 — b, by — ex A=I 
b, re) — b, — ey A=2 
— by Dy O — é, A = 3 
ex ey ez 9) Ad, 
Raa = O — d, d, hy A=1 
d, O — dy, hy A= 2 
—d, dy O UPS es 
—h, —hy —h, o* | A=4 


where we have introduced x, y, z, ¢ for co-ordinates and time 
in the gravitational frame, considering them to be alternatives 
to %1, Xo, X%3, X, aS a matter of convenience. 

In casting about for “constitutive’”’ relations between 
b, d, e, h as a necessary step towards solution, we must choose 
them so as to satisfy Relativity and be true to nature in any 
frame where we can make a test. Referring to Chapter VIII., 
we pointed out the fact that tensors of different types could be 
related by “‘ association ’’ with one another. Now F aw and Ray 
are both covariant tensors, and could not be so associated ; 
but we can form contravariant tensors from them by “ recipro- 
cation.” E.g., if PF is the reciprocal of Ry, we know from 
Chapter X. that 


* It is very necessary to bear in mind the remark immediately after 
equation (10) of Chapter IX., concerning the replacement of x, \, fy v 
by actual digits. 
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gh == 1@) h, oe hy <= de 
hy 7 hy 0 aoa d, 

dz dy d; O 

and if R* is reciprocal to Fy, 

gRM@—= 0 — é, by — 06, 

e, O a Cx aod by 

— ty ex ) — 6b, 
b, by b, O : 


Let us now postulate that R* is the contravariant tensor 
associated with Ra,, so that 


Rr = graprBRag . ; ; 5 7) 


This is a set of twelve linear relations between the twelve 
field vector components. It satisfies Relativity, and it de- 
generates to the correct relation in a non-gravitational frame. 
For in that case £1, = £20 = £2 = — 1,24, = 1, the others 
being zero ; also g = — I, and therefore g!! = g?2 = p33 — — 7, 
g** = 1, and the others are zero. Thus it is easily seen that 
(7) degenerates into the equalities 


a= 6 
b=h 


which leads to the usual well-known forms of the equations in 
vacuo. 

From a result dealt with at the end of Chapter VIII., we 
know that equation (7) is consistent with the equation 


Pr — gragt8F yg, ; : ~o) 


so that F* is minus the associated tensor of Fy, ; and it is easy 
to see that it likewise degenerates to the familiar form in the 
absence of gravitation. 

A very interesting application of (7) and (8) has recently 
been made by Silberstein and Rankine.f Let us assume that 
co-ordinates have been chosen so that g = — I, and assume 
also that the coefficients 214, £24, &34 are zero. It follows that 
git — 94 — p34 — 0. Hence from equation (7) we find that 


* N.B. that the components of these two tensors are not formed 
from the field vector components themselves, but from their quotients 
after division by g or ( — g)?. 

+ «Phil. Mag.,’”’ May, 1920, p. 586. 


220 RELATIVITY 


aa by a Ri — gitg4BRag 
—- g*4*(eUh, — gehy a gi8h,). 


Similarly 
rere by —_— gt (27th, = hy = 2"3h,) 
and — b, = gM(g*4h, + gPhy + g*h;). 
That is, by = Virhx + Vichy = Vighz . ° (9) 
and two similar equations, where the six coefficients yay(= Yya) 
- are functions of the tensor components £33, 212, - - +» &33) &aa- 


In the same way we can show from (8) that 


Ay = Virex + Y120y + 713% . ~ ite) 


and two similar equations. me 
It is known that if we rotate the space axes into the positions 
occupied by the principal axes of the ellipsoid, 


Y11%" + Yo2V? + Y332" + 2yeaVZ + 2y—i2% + 2yioxy = 1 


where y,,, etc., have the values at a point P, then in the neigh- 
bourhood of this point the equations (g) and (10) become 


by = yihx dy, SS Ve ’ 
by = yxhyp (9). dy = Y2ey~ (IO’) 
b, = ALE 4, = V3lz 


the components being now the resolved parts of the vectors 
along the new axes, and yj, y2, ys being the “ principal ” tensor 
components of the three-dimensional tensor yay. Of course, 
Yu Y2 Ys are functions of the ga,-functions. Putting p equal to 
zero in (5) and (6), let 


ey = A, sin p{t + (l% + my + nz)/v} 
and two similar equations, and 
h, = B,sin p{t + (le + my + nz)/v} 


and two similar equations, so that we are considering the pro- 
pagation of a plane wave whose frequency is p/27, whose wave 
normal has the direction — /, — m, — n, and whose velocity is 
v. From equation (5) we get 


vy,4, = mB, — nB, vy,D, = nA, — mA, 
vy,A, = 1B, — 1B, Vy,By = 1A, — nA, 
vy;A, = IB, — mB, vysB, = mA, — lAy. 


From the first, fifth, and sixth of these it follows that 
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v*y,A, = m(mA, — 1A,)/y, — n(lA, — nA,x)/yo, 
Sov yyyaysdz = (myg + n*y3)A, — Umy Ay + ny3A,) 
; = (Py, + my, + Wy3)Ax 
since ly,A, + myAy + ny,Az = 0. 


Hence A, cancels out and leaves 


v2 = (yl? + yam? + ygn*)/yiyeys - ees) 


Exactly the same conclusion would be deduced from any 
other group of three of the equations. The interpretation is 
that while the velocity of propagation depends on position in 
the field and direction of propagation (a fact we are already 
familiar with) it is also independent of the orientation of the 
plane of polarisation if it is a plane wave. This is a negative 
conclusion no doubt. Were it not true, a plane wave with its 
plane of polarisation oblique to the lines of the gravitational 
field should, if propagated in a direction at right angles to the 
field, gradually become elliptically-polarised owing to the un- 
equal velocities of propagation of the components of the light 
vector parallel and perpendicular to the field. A careful search 
for such an effect was made without result, thus justifying the 
conclusion drawn from the equations. Apparently the experi- 
ment was extremely precise, for had the speeds of the two 
components of the light vector differed by 10 ~ !* of the standard 
speed, i.e., by -03 cm. per sec., the ellipticity produced in a 
path of 40 metres across the Earth’s field could have been 
detected. 

Returning now to the general argument, we see that the 
equations of the electromagnetic field can be thrown into 
tensor form and so brought under general Relativity. To do 
so we make use of the covariant field six-vector I’,,, the con- 
travariant field vector FP, and the two reciprocal covariant 
and contravariant field vectors R,, and R. With this 
material a variety of forms are available. 

Thus (3) can be written 


Gee) iam ge oe , (B) 
which is equivalent to 


(PM)p= pf. es (3B) 
and (4) can be expressed as 
g—10(gRA*%)/dx. = 0 ; ; . (4A) 


pO (RAs), = 0 ‘ : «, - (4B) 
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Owing to the fact that we can derive a six-vectorial Curl 
from a covariant four-vector, it is possible to form a vector 
potential for the electromagnetic field which is a covariant 
four-vector. 

In fact, since divi Di=.0; 


we can write b = curl a, 


where a is a three-dimensional vector potential. 

Write A, = a,, Ag = ay, A; = @,, Ay = — $ whete ¢ is the 
scalar potential of the field. Then A, is a covariant four-vector ; 
for assuming it to be so, its Curl ought to be a six-vector, and 
this is so since 

dA ,/d%3 — dA,/d%_ = dA,/dz2 — d4,/dy 


x 


F os 
daz|dt + d6/dx 
— Cx 


= Fy, 


eo tt Wl tl 


0A 1/0%4 a 0A 4/ 0X4 


and in general 
dA y/d%u = dA, /d%) = Pye: 


ELECTROMAGNETIC FORCE-ACTIVITY VECTOR. 


If we follow the development of the theory in the same 
manner as in Chapter VI., we consider the vector which is the 
inner product of J* and Fa,. It is, of course, a covariant 
vector, and we shall denote it by K,: 


Ky = Fy, Jt. 


In terms of the field inductions and intensities, current 
density and charge density, we have 


K, = — 47 pléx + vb; — v,by) 
K, Sy tp(ey a Udy = Vxb;) (12) 
Kz = — g71p(ez + vxby — vybz) ; ; 
Ky = q7~'p(vxez + yey + v.62) 
or if we write k for the three-vector p(e + [Vv . b]), 
Ki 9" the 
K,=—q7-tk, 
Kg = — q~ tk, 
Ky =¢-(ve_k)e 


* The difference between the sign occurring here and that in 
Chapter VI. is due to the change in the sign of the square of an element 
of separation. 
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According to the usual interpretation, k is the electro- 
magnetic force on charged matter per unit volume. Supposing 
a particle of volume 6r is charged, the electromagnetic force on 
it is kér, or the electromagnetic force-components and the 
activity are 

— qK,6r, — qK 57, — 9K,5r, 7K ,8r. 


Now q6x,6x,8x,5x, is invariant, and therefore gP5r is 
invariant. Hence if the force on an electrified particle due to 
the electromagnetic field is f, we see that Bf,, Bfy, Bfz, — B(v . f) 
are the components of a covariant four-vector. Call it Py. 
Now let us endeavour to construct a possible equation of 
motion of such a particle in the gravitational field while under 
electromagnetic force. Let the volume of the particle be 
57, and py be its mass density when at rest in Galilean axes. 
Then an equation such as 


BooTo(%a + {aB, A}xax%~e) = P) 
is impossible, since the left-hand side is contravariant and the 
right-hand side covariant. But 
HodT0(%, + {aB, A}%a%a) = — BP 
is a possible equation. 


Now 
d/ds = dx,]ds . d/dx, 
== Ba/at. 


Therefore the equations become 


d(193ro Bax /dt) |dt + p87 o8b1(Vx, Vy, V2) 
(Uo TP Mey )/ pein o§ ef, a gist, Be g4(v ; f), etc. 


Now 
gpdr = St 
it follows that Bu0T) = VB? y9T 
=— por 
if be = IP °p9. 


If now p represents the mass density of the particle in 
motion in the gravitational frame, the mass of the particle m 
is related with mg, its rest mass in Galilean axes, by 


m = pdr = B87 


= Bp0®To 
= Bm, 


and the equations of motion and activity become 
d(mv,) [dt + mby (Vz, Vy, Uz) = — 8fx — BM fy — BM fe + guy!) 


and two similar equations, and 
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dm|dt + mg 4(Vz, Vy, Uz) = — Bttfs — Bly — Bz + s(v.f) 


where ¢,, . . ., 6, are four quadratic functions of the velocity 
components whose coefficients are the Christoffel symbols. 
These equations pass the Relativity test and degenerate to the 
form arrived at in Chapter VI. for a frame free from gravita- 
tion, viz., 
a(mv,)/dt = f, 
dm{dt = (v . f). 
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The next step is to express K, as the generalised Lorentzian 
of a tensor of the second order. Being guided by the earlier 
theory in Chapter VI., let us work out the components of the 
mixed tensor 


£(Pryrgl et — Ryagke), 
which we will denote by Ey": 


Ey? = q7~W(exdy + hyby) 

Ey = q7+(egdz + hzb,) 

E\* = — q7*(d,b, — d,by) 

Et = q— leyd, + hyby) 

Ey? = q~ *(Cydy — $3 (Cxdx) + hyby — $2 (hxbx) iS 


E 4) = q7i(eyh, — ezhy) 
1. ra jee 143 exdz “fe $>h,,) J 


Now let us write 


ty, for (hx = $>(é,4x) + hb, — $3 (hzb,)) 
tey for é,dy + h,by 


gx for d,b, — d,by 
Sy for ey, — e,hty 


« for 1(Send, + Sh). 
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4 Ina frame of reference free from gravitation we know that 
the following equations of momentum and energy-transfer in 
an electromagnetic field are true: 


Yxx/ DX + Ibzy/W + Ikxe/dzZ — Igz/3t = Ry 


dSz/d% + dIsy/Iy + ds,/d2 + de/dt a ie (v _ k), 
for in that case g = —1,d=0, b=h. 
These are 


aE lox, = = Ky = — Fy, Je. 


Eyeit follows from the Principle of Relativity that the equations 
In a gravitational field are 


(Ey*)o == ea fn . * d (15) 


since both sides are covariant vectors, and each side de- 
generates into the corresponding side of the equations above, 
which are true in a special frame. 

Using the result (12) of Chapter IX., we can write (15) 
rather more explicitly as 


9 30(qE)*)/d%_ — {Aa, B}Eg* = — Fyaj*; 
or, using the values written above in (12) and (14), 


Mtxe/dX + Mxy/Iv + dty,/d2 — dg,/d¢ = ky, + {Ia, B}Ep* 
and two similar equations fe 


dSz/d% + dS,/2y + ds,/32 + de/d9t = — (V.K) + (4a, B}EQ* 


Interpreting, as usual, t,z, ty, etc., as components of a 
stress in an ethereal medium, g as momentum-density, 8 as the 
Poynting vector or energy flow per unit time per unit area, 
and ¢ as energy density, we can then interpret 


— {ta, p}Eg*, — {2a, B}Ep*, — {30, B}Ep* 


as components of the gravitational] force exerted on the electro- 
magnetic energy, or as components of the transfer of momentum 
per unit volume to the electromagnetic field from the gravita- 
tional field in one unit of time ; {4a, 8} as the transfer of energy 
per unit volume from the gravitational field to the electro- 
magnetic field in unit time. 

Alternative expressions for these transfers are + [Aa, B]E%° 
(which = + $E*2gag/dx,), or + FEB dg8/dx,, i 
| 15 
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where EA == prak 
and ayy = uel. 
If the space axes happen to be those which are “ principal ”’ 


for the linear relations between d and e and b and h (see equa- 
tions (g) and (10)), then 


Sz = q~ 4(€yhz — ezhy) 
(Vy 273) — (dybz — azby) 


ll 


CV Ex 

2 
Sy C2 Sy 
Sz = C3°8z 


where ¢,, Cs, Cs are the velocities of light along the axes (see 
equation (I1)). 

These are the customary relations between the energy- 
stream vector and the momentum-density vector. 

It is of interest to express E)“ in terms of the two field 
six-vectors Fy), and f+. This is done as follows : 


Bag ht® = EO gs Page EAs 
Beet Se eee 
and hence F,,f1* — R,,R™ = 2F,,F' — 1FygF 8, 


Also, TO a NY sh, COU a ee 
= PME FSP |. 
Therefore fo Ree = oh 
and so on. 
Hence it is easy to see that 
LON UR hater ea T a (a 


remembering the mixed fundamental tensor g,* is equal to 
unity if A= p, and to zero if A + yp. 


CHAPTER XAT. 
GRAVITATION AND MATTER. 


THE DYNAMICS OF CONTINUOUS MEDIA IN A GRAVITATIONAL 
FIELD. 


REMEMBERING the sequence of steps in the development of the 
theory for homaloidal space-time, we naturally expect that the 
equation (15) of the previous chapter will give us a clue to the 
form taken by the equations of momentum and energy transfer 
in general. This expectation is quite justified, as we shall see ; 
but at the moment we must deal with a question which did 
not trouble us in the earlier development. We know that in 
Newtonian theory ‘“‘ mass’ plays a double réle. There is, in 
the first place, “inertial’’ mass, the factor which is associated 
with velocity in the expression for momentum. As a concept 
it has no direct relation to gravitation, and is defined so as to 
be free from any confusion with “ weight.” It is with regard 
to inertial mass that conservation is assumed to hold in the 
first instance. But there is also “ gravitational’ mass whose 
measure appears in the numerator of the expression which 
measures the gravitational force between two bodies, the square 
of their distance apart being the denominator. It is notorious 
that despite the exhortations of academic literature on the 
subject, the distinction between the two meanings of the word 
is a shadowy one to the majority of elementary students, the 
main reason for this being that in the theory the measures of 
the two quantities for any body bear to each other a ratio 
which is independent of the nature, size, and position of the 
body—a postulate which has been subjected to very searching 
tests—notably by Eotvés—with confirmatory results. But 
more advanced students are aware that in the treatment of the 
gravitational field within matter, the equation of Laplace, 


A¢ = 0, 
gives way to the equation of Poisson, 
Ad = anc, 
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where x is the density of the gravitational mass of the matter. 
Now, as we have seen, Einstein’s law of gravitation outside 
matter, 


degenerates in a first approximation into the equation of Laplace. 
It is at once suggested that within matter the potentials must 
satisfy some other equation, possibly one in which some co-vari- 
ant tensor replaces the zero of the right-hand side, and that this 
tensor should be calculable in terms of some quantity or quan- 
tities measuring the gravitational influence of the matter ; and 
if this be so, there arises the question as to the possibility of a 
close relationship between these quantities and the quantities 
such as momentum and energy which are covered by the 
conservation law. 

Bearing in mind, therefore, the possible need for a generalisa- 
tion of Einstein’s law of gravitation, let us proceed to deal with 
the covariance of the equations of momentum and energy 
transfer in a region where the potentials have general values. 

Experience so far suggests that a suitable form for the 
equations will be possible if a tensor of the mixed type 7)" 
can be constructed out of such material as the density of energy 
or inertial mass, and the density of momentum. In fact, in 
Chapter VII. we have the degenerate form for such a tensor, 
and examination of this form shows that the 44-component 
is e«, the energy-density, and if the element of volume sur- 
rounding a given point is at rest in a Galilean frame and the 
matter unstrained, all the components would still further 
degenerate to zero, except the 44-component which would be- 
come €9, the vest energy- (or mass-) density. Now it is not 
difficult to construct a tensor in any frame which degenerates 
to €, under like circumstances. (In the first instance the tensor 
is contravariant, but we can then obtain a mixed tensor by 
association.) Suppose m, is the rest-mass of an infinitesimal 
particle. Then 

mdx/ds . dx,/ds 


is a contravariant tensor, since my is an invariant quantity. 
The sum of such terms is also a tensor, i.e., 3 (mdx,/ds . dx,/ds) 
where the summation extends over all the particles within a 
certain element of proper volume 87,4. Now these particles will 
each have a world-line in the space-time. Let us choose points 
on these world-lines corresponding to a given value of s; these 
points will all lie with a certain four-dimensional element of 
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volume, and by a natural extension of two and three-dimen- 
sional geometry, this element will have a centroid %,, %», %3, X4, 
so that if +, = x, + &,, etc., then 


DME, = Lg EF, = LM Fk; = WMoE4, = O 
and therefore 
S(modx/ds .dx,/ds) = (Ymy)dxy/ds . dx,/ds + SmoedE,/ds . dE, /ds. 
The first term on the right-hand side is equal to 
BU 9d7 94x) /dXy . dk, /dx4 


where jy is the average density of the rest-mass in the element 
of proper volume 879. On writing w = f?quo, and dropping 
out the invariant factor 57), these terms become 


ee Oe Oe Og, CCC. goa ian Gs 
We take w as a measure of the density of mass or energy 


in the gravitational frame, which is greater than the rest-mass 
by the kinetic energy of the element as a whole, for 


BST = P*gp dt = BurgdTo. 


Then pv,, pv, wv, are components of momentum-density or 
energy-stream arising from the motion of the element as a 
whole. 

Treating the second term in the same way, we see that 
when A and p are equal to I, 2, or 3, the terms are the components 
of a three-dimensional stress-tensor (of the pressure type) 
multiplied by g-1. Let us denote this tensor by py,, pry, etc. 
The 14, 24, 34, 41, 42, 43-components are additions to the 
momentum-density and energy-stream arising from the internal 
motions of the element, and the 44-component is additional 
energy-density due to the internal motions (thermal energy) 
and the existence of pressure within the element of volume. 
These additional terms are also affected by the factor g—}. 
Considering p to represent the total energy- (or inertia] mass-) 
density, g to represent momentum-density, we see that this 
contravariant and symmetric tensor is T*“ where 


b= u=2 u = 3 Lene 


a8 Se = Pax + Sxlx Dyn + &yVx Dax + Be¥x | Bx [ast 
Pay + &xdy | Py + 8y%y | Pyy + 8x¥y | By | aA=2 (x) 
Paz + &xd2 | Pyz + Bytz Pur t+ 2e0s | Bs |a=3 
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(The bar is dropped as no longer necessary.) 
The associated mixed tensor is 


HRS 3 AN ee : : : Bes 
and there is an associated covariant tensor 
4 Fg 9 PB ee . . <, 12) 
In the restricted theory the degenerate form of 7" is 
— (Dux + &x0x), — (Buy + Bey), — (Pez + &x%:), — & 
two similar sets of expressions, and 
a Sy & ma 


and the equations of transference of momentum and energy to 
the material system, which are, in this case, 


Wx/d% + IDay/My + IPz2/d2 + Dg,/dt = px, 
two similar equations, and 
d8z/0% + dSy/dy + ds,/0% = (V . p) 


where p is the force excluding gravitational force exerted on 
the matter per unit of volume, can be written 


RW BN ee — i Py 


where P) is the vector Bp,, Bb,, Bbz, — B(V . p). 
This equation is clearly the degenerate form of 


Ties = Pyle i a eee 


which is consistent with General Relativity, and so can be 
taken to be the equation expressing the general laws of 
mechanics. Alternative forms of (4) are seen by a glance at 
equations (12) of Chapter [X. to be 


q~70(9T4")/d4%q = fra, B}Te*— Py, . (4a) 
q~ *0(qTy* Vio he eat a 

= 3TPdgop/ox,— Py _. (4B) 
q~ *0(QLy*)/2%a = — ZT apdg*8/0x, — Py, —. (4C) 


We can write equations (4) more explicitly if we introduce 
these symbols : 
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Wee = &1rP xx + BroPyx + Lishex 
Tey = S11Pzy + Biebyy + BisPey 


Tye = SarPex + LooPyx + SosP ex 


Wee = &siPxz + SaoPyz + kasPez 
Ox = SarPxx + SaoPyx + asPex 


Oz = BarPez + SaePyz + LasPar 
Vx = 8118% + 8128y + £1382 + £14 


Ys = 838% 1 Sae8y + &s38z + Saal 
v = 8ai8x + Say + 84382 + Laat 
On referring to (4) and remembering that g, = s, = py, 
we find that they can be written in the form 


Q~ (dt zx/d% + Itrzy/ MV + Itryz,/02 + Dy,/dé) 
=}T*82g05/0% — Bp; | 
two similar equations, and ~ (4a 
g—*(d0,/d% + doy/dIY + da,/d2 + Dv/d#) | 
= 2T®dgap/d¢ + B(V - P) 


As we have already pointed out, “ gravitational force ”’ does 
not exist as a fundamental concept in the Relativity Theory 
of Gravitation, but equations (5) show that if we wished to 
retain the term for convenience of expression, then the com- 
ponents of gravitational force on matter per unit of volume 
would be measured by 


* As usual, the dimensions of the various terms in these equations 
are rather concealed by the use of the Relativity unit of time. If we 
wished to introduce the second as the unit of time and express » in 
ergs perc.c., g in grammems per C.c., and p;,, etc., in dynes per sq. cm., 
we would have to alter the quantities g,,, Zo;, g3, to numbers c times 
larger, and g,, to a number c? times larger. In (5) we would then have 
to write ./c for » in the last term of the expressions for y,, yy, yz, SO 
that this last term would be obviously a momentum-density in grams 
perc.c. In the expression for »y we would write g,/c, etc., for g,, etc., 
and p/c? for », so that » would be a density in grams perc.c. But 
actually the dimensions can be shown to be correct in the expressions 
as written above, if we remember that the velocity of light along the 
axis of x, for example, is a root of the equation 


&,,(4%/dt)? = 28,,4%/dt + 8, = 9, 


so that g,,/g,, is the square of a velocity; and g,,/g,, is a velocity, as are 
E 823/81 and 834/811 also. 
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— $B -1TBdgag/dx, etc. . : . 6) 
and its activity would be measured by 
$B —1T28dgq8/dt.  . : : . (6A) 


If the force density p which occurs in the equations just 
dealt with is due to the action of the electromagnetic field on 
matter, then by reference to Chapter X. we see that 


Le tea 
and in consequence 
(Ty*)a = Pra J? 
=e (Ey*)a 
or (Ty")a + (Ea%)a = 9, 


‘ 


which, after all, is consistent with the “‘ material’ nature of 
all energy implicit in the Relativity Theory, and implies that 
the complete ‘“‘matter’’ tensor should embrace the energy, 
momentum, and stress of the electromagnetic field just as much 
as the concentrated energy, etc., existing in the minute regions 
which we term electrons and nuclei. If we desire to be as 
precise as possible, the electromagnetic field would be defined 
as that part of the space frame where the e and h vectors 


satisfy 
(FFA), we) 
(RA*), = 0 (A) 
and the gravitational field as that part where the g,,-components 
of the fundamental tensor satisfy 


Guo. GS ne 


‘“‘Matter,”’ then, must be that part of the frame where neither 
(A) nor (B) are true. Instead of (A), we know that in matter 


(Fs), = p 
(Rie NO 


may be true where J* is defined in terms of the quantity p, 
called density of charge. There are two ways of treating this : 
either by regarding these parts as of no finite magnitude, but 
as “singularities in the field’”’ where p is infinite, and not 
concerning ourselves with them at all; or by giving finite 
magnitude to them, “ smoothing out” the grained structure 
and attributing a finite value of p to each point of it. In the 
same way we might treat matter as a region throughout which 
(B) is true except at certain point singularities having definite 
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“strengths,” these strengths being constants introduced in 
solving for the potentials from (B). We are aware of such 
treatment in Newtonian theory where matter is regarded as a 
congeries of points having strengths M,, M,,... My, and 
the potential at all points except these is given by 


3M, /7;. 
t=1 


But just as we can treat Newtonian theory by “‘ smoothing 
out " these singularities over a finite region, and then introduc- 
ing Poisson’s equation instead of Laplace’s, so this method of 
treatment can be adopted here, and the question arises—What 
is the analogue of Poisson’s equation ? 
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The answer which has been given by Einstein to this question 
is obtained by a rather lengthy and involved development of 
the tensor analysis. The beginner, as he makes his way 
through the details, is apt to lose the general thread of the 
argument and forget the main purpose in view, while he is intent 
on verifying the mathematical truth of each step. So it seems 
wiser to set out broadly the trend of the argument, filling in 
the mathematical details later. 

The first step consists in the introduction of the function 


grtya, B}{eB,a}- - - (7) 


which we will denote by the symbol y. It is obviously homo- 
geneous and quadratic in the quantities dga,/dx,. But by 
equation (2) of Chapter IX. it appears that it can be transformed 
into a function homogeneous and quadratic in the quantities 
dgr#/dx,. Also, since each ga, is a homogeneous function of 
degree — I in the g™, it is easy to show that coefficients of the 
squares and products of the dg*#/d%, in y are themselves homo- 
geneous functions of the g’ of degree — 1. In what follows 
we shall use the symbol g,’ to represent g*#/d%,. (It is not a 
tensor, and must not be confused with the covariant derivative 
(g#),, which can be shown to be identically zero by equation (5) 
of Chapter IX.) Thus y can be regarded as a function of fifty 
independent variables, viz., the forty g,* and the ten g* ; and 
by Euler’s theorem on homogeneous functions 


SaPVx/ BaP = 2X \ (8) 
g8roy/dg87 = — x. : , 2 
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On giving arbitrary variations to these variables, it appears 
that A, 
dy/dgyr = — {Ap, \ 
ye = — Da, PuBahf 


In the second step, Einstein introduces the sixteen quantities 


Ere — BrbrPoy/Mwuby —. - (20) 


which he denotes by at)“. The ¢,# do not constitute a tensor, 
but, as will appear in the sequel, they enter into covariant 
equations in a manner similar to tensors, provided a certain 
restriction is imposed on the co-ordinates. (The multiplier a 
is a numerical factor which is determined later.) For this 
reason we do not use the usual capital letter, and we can refer 
to them as components of a ‘‘ quasi-tensor ”’ of the mixed type. 
We have also a quasi-tensor of the covariant type, Zuata%, which 
we will denote by t,,. If we write ¢ for the contracted quasi- 
tensor f,%, we have 


t= 4Y — 2V = 2x. : ; Pa eee 

It can be proved that 
a(t" — Zgyrt) = gyBr{By, w} - a (12) 
and A(tru — B8aut) = &°7[By, w]. =. aS) 


Three equations are now proved involving these quasi- 
tensors and also the function ¢,, defined in equation (18) of 
Chapter IX., as 


— dfAu, a}/dxq + fAa, B{uB, a}. 


This function is also a quasi-tensor (which acquires full 
tensor properties in a system of co-ordinates for which g = — 1), 
and we can derive from it the associated mixed quasi-tensor 
glib, or d,“. The three equations are 


Suy2(g87{AB, a | [d%q = ZA(tay — B8aut) — Gru» (T4) 

d(gHPAB, a})/d%q = Zaltyt — FEy"t) — Pat (15) 

and evrrbay = — Adt*/dI%Xq. . : (20) 
The last step in the analysis produces the equation 


£0°(g%80 log q/d%p)/0%)d%q 
= $A0bj2/9%q — (d6)%/d%q_ — d4/0%,), (17) 
where ¢ = gBdag. 


On this mathematical material Einstein bases the following 
train of thought. 
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Supposing we restrict our co-ordinate system to be such 


that g = — I everywhere, then the law of gravitation outside 

matter becomes ¢,, = 0, and so outside matter in a system of 
reference for which g = — i the following equations are true: 

Suyd(SPMAB, a})/0%a = 34(tau — H8rut) =». (14") 

O(SHAIAB, a})/3%q = fate — feyt) . .  (15') 

d¢)2/d%_ = O. } ‘ : rs) 


Equation (16’) is in the form of a conservation law. It is 
true that 7) is not a tensor, yet the equation (16’) is covariant 
under the condition g = — I; i.e., if we transform to accented 
co-ordinates for which g’ = — I, then 2¢’)*/d%,’ = 0; but not 
so if g’+ —1. Just as 7)“ was a stress-momentum-energy 
tensor for matter, so we can call this quasi-tensor the stress- 
momentum-energy quasi-tensor of the gravitational field; and 
in this field, so long as we deal with an extension in the space- 
time continuum, which does not contain the world-lines of 
matter and which is so large that at its boundaries the gy, 
take their constant Galilean values (and in consequence the 
t,“ vanish), then there is conservation of (¢,4, ¢,4, 734, ¢,*), pro- 
vided, of course, that our co-ordinates are subject to the 
condition g = — I. 

It would seem natural, therefore, when dealing with portions 
of space-time within the boundaries of material world-tubes, 
to introduce the matter-tensors as well, and replace 4,“ by 
ti, + T4, and baw by bry -- Eves 

This would involve, as a conservation law within matter, 
that the potentials should satisfy 


(ty? + T)%)/d%. =O . é a0) 

and (14’) and (15’) should be replaced by 
Luyd(SPYAB, a})/D%q = $4({tau + Tru — d8audé + T}) . (29) 
d(gHBIAB, a})/2%q = $a({ey" + TH} — 28r4te + TH), . (20) 


where TJ is the invariant quantity 7,*, and we are still imposing 


the condition g = — I. 
Now it is only necessary to assume that, within matter and 
subject to the condition g = — I, either (19) or (20) is true, 


for the other one will follow by an inner multiplication ; 
equation (18) can also be deduced from either. Assuming, 
therefore, that (19) is true within matter, we proceed to modify 
it somewhat with the help of (15), for, of course, (15) is true 
anywhere. It is easily seen that from (19) and (15) we obtain 


Pru = — $4(Tap — $eaul) . : oll 2% 
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and so Einstein assumes that this is the general law of gravita- 
tion for a co-ordinate system in which g = —1I. To give ita 
general covariant form is not difficult, for we recall that ¢a, 
is the degenerate form of Ga, in a system subjected to the 
restrictive condition g = — 1. Hence Einstein’s complete law 
of gravitation for any frame is 


Gay = — $4(Tap — 28rn7), 


which is unconditionally covariant, since it is a linear relation 
between covariant tensors. The value of a can now be easily 
obtained by an appeal to the approximate Newtonian theory. 
Just as in a first approximation G,, = 0, degenerated into 


3 
$2078 44/0%n? = 0, 
n=1 


so Gag = — 34(L as — 38 aT) 
degenerates into 
3 
2 20°84) d4n” = — 44fo; 


for all the components of 7)“ and T), vanish except T,* and 
T4,, these being py in each case, and so is T also equal to po. 
Moreover, g4, is in the limit equal to 1 — 2%, where ¢& is the 
Newtonian potential, and thus 


Ads = 4aUL9. 
But this must be Poisson’s equation ; hence 
@—=A0nk, 


where « is the Newtonian potential due to a concentrated erg 
of matter at a distance of 1 centimetre. In other words, «x is 
the astronomical constant 6°7 x 10~8 divided by the square 
OL. C019 xX I0"* 371.6. 

k= 7'4 X Io ® 
and a= 374 X Io7*, 


It is in this way that Einstein arrives at his general law of 
gravitation : 
Gy. = — 8irk(T ry — $2,,7) : (22) 
As regards the standpoint of Relativity, this law is, of course, 
satisfactory. As regards “‘ truth to nature,” it becomes 


Gi 0 
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outside matter, for there T,, = 0, and the consequences of this 
will be discussed in Chapter XIII. In a first approximation 
we obtain the Laplace-Poisson equation of Newtonian theory. 

It will be as well to repeat the reasoning in a somewhat 
different fashion, which will lead us to a conclusion of extreme 
importance. 

Equations (14)-(17) represent pure mathematical relation- 
ships between the various ¢ and ¢ functions, independent of any 
law of gravitation. They are, in fact, in the nature of necessary 
conditions which no law of gravitation can logically violate. 
Suppose the law of gravitation is 


Gay = — $4(Tap — 38ayT), 


where T,, is the matter covariant tensor defined earlier in 
the chapter, and let us limit the co-ordinates so as to satisfy 


the condition g = — 1; then we have 
34(D nu — $8rul) = — fru 
An inner multiplication yields 
2a(T* — 48yT) = — $y, 
and by contraction 
te 
so that 
gaT y* = — (py" — $849). 
Hence 
4a0T \*/d%q = — $adly4/d%q 


by (17), remembering that 2d log g/dx, = 0 since g is constant. 


Therefore 
d(ty% + T *)/d%q = 0. 


This is (18), and the two just written above are easily seen 
to be (19) and (20), by reference to (14) and (15), thus justifying 
the statement that if (19) is assumed to be true as a law of 
gravitation, (18) can be deduced from it. Equation (18), 
moreover, is by its form a law of conservation ; it should cover 
the conservative transformation of energy and momentum 
between matter and the gravitational field. This can be seen 
still more clearly by eliminating the quasi-tensor ¢,“ between 
(18) and (16), when we obtain 


AdT y*/d%—q = EPpy. 
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Now referring to (21) and remembering that gg,dg47/dx, = 0, 
since g, or geyg87, is constant, we obtain 


dT %/d3%q = — $7 pydg7/d%), 


which is just the equation (4) for this restricted choice of 
co-ordinates. It therefore appears that Einstein’s law of 
gravitation contains the principle of conservation within it so 
long as this restriction holds. It is natural to inquire if this 
restriction is necessary. Supposing, therefore, the co-ordinates 
are quite general, so that Einstein’s complete equation 


Grau = — $4(Tay — 28anT) 
is used. By inner multiplication we obtain 
Gye = BG ig = Be Ee eee 
and by contraction 


GG Stee eee 
Hence 
241)" = Gy4 — FgyKG. 


Now we have seen earlier in the chapter that the uncon- 
ditionally covariant law of transfer between matter and the 
field has the form 

(La =O; 


Hence if this is to be consistent with the law of gravitation 
as suggested by Einstein, it follows that 


(G)*), is half the contracted derivative of g\"G 
or (Gq = $0G/0%y. ; : ; : s (24) 


Now, as a matter of fact, equations (24) are true. But it 
will, as before, be wiser to postpone the proof of these identical 
relations until later. It has already been stated that although 
there are apparently ten equations of gravitation, only six are 
independent, since there exist four relations between the Ga, 
components. Equations (24) are the four relations in question. 

Having explained how the structure was raised, it may not 
be unprofitable to remove the scaffolding and view the result. 

As a foundation we have the identical relations, which are 
independent of any law of gravitation, viz., 


(G\*), = $0G/dx). 
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Einstein suggests as the complete law of gravitation, con- 
necting the potentials at a point with the distribution of material 
stress, momentum, and energy immediately around this point. 


Gay =>— 8ak(Try = $2aul ) 
or its equivalent : 
Gay — 48auG = — 8aeTray .- 5 A224) 


From this and the above relations we derive the ‘“‘ laws of 
mechanics ’”’: 
ears = 0. 


In short, Einstein’s law is not merely a law of gravitation ; 
it is a general dynamical principle. In classical dynamics 
gravitation stands outside the general system of equations 
(Lagrange’s or Hamilton’s), which are entirely independent of 
any law of gravitation. Only in the identification of gravita- 
tional and inertial mass is there any point of contact. Inthe 
dynamics of General Relativity the fusion of dynamics and 
gravitation is complete, and, incidentally, the identification of 
gravitational and inertial mass is a restricted aspect of the fact 
that in the calculation of the Einstein tensor G),, the inertial 
mass (energy) and momentum are involved in the tensor Ty. 

We shall now deal with the proof of equations (8)-(16), upon 
which Einstein’s reasoning is based. Before proceeding, the 
reader should refer for a moment to the equations (2)-(8) of 
Chapter IX. as they will be frequently employed in what 
follows. If we vary the independent variables in x by dg’ 
and 6g, we obtain 


8x = {ya, BEB, a}dgr + gr({ya, Bid{eB, a} + eB, a}8tya, B}). 
The second group of terms on the right-hand side 


= 2g7'{ya, B}B{eB, a} 
= alya, PIS(er{eB, a}) — afya, BYeB, a}8e% 


Hence 
8x = — fya, BiB, a}5er + 2fya, B}8(g7{eB, a}). 
But ; 
2g7{eB, a} oma! 812° ILen/d%B aE en] Xe ae Wpe/d%n). 


In the summation of the previous line it will be found that 
cancellation of terms gives 
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by = — ya, BYeB, a}8gr + {ya, B}8 (729 08en/ 22) 
= — fy, BYeB, a}dgre — {ya, B}3(0g7*/2%8) 
= — fya, Bi eB, a}dgv — {ya, yoga. 
Hence results (9) follow. 
Applying (9) to the definition (10), we have 


aty# = grey + gyBr{By, pu} 


and by (8) 
at. = Lax aa LaPVDK/ dB a 
== AN AX 
= 2y. 
Thus, 


a(t — gett) = gyPr{By, wr}, 
which is (12). 
Putting » equal to a, and effecting an inner multiplication 
by gua, we obtain 
Atru — 28rut) = BrP yalBy, a} 


eee = gP7[By, 1] 
which is (13). 


In obtaining (14) and (15) we start from 
(Au, a} = guF{AB, a} = SuygO4{AB, a} 
and differentiate with respect to the co-ordinates so that 
(ys, a}/d%e = Buyd(GEr{AB, a})/d%a + BM{AB, a} 2€uq/ De 
The second term on the right-hand side is equal to 
— IAB, a}g9% 2 8und8°"/ d%a 
= — (AB, a}8undg82/d%q 
= UB, a}gun(g8{ya, n} + g{ya, B}) 
= s'rAB, a}lay, wu] + {AB, a}{ua, B} 
= 2(87P{BA, a} + ge4{BA, y})[ay, w] + (AB, a}{ua, B} 
= — 28xay, w] + {AB, a}{ua, By. 
Hence 
~ $e = Buy>(GONAB, a})/0%q — SerrTBy nw). (A) 


Putting 4 equal to « and employing inner multiplication by 
ge, we obtain 


— BEBre = d(gHALAB, a})/D4%a — dB ,°7{By, wh (BY 
Equations (12) and (A) give (14). 
~ (73) 5, -(B) S99Gs). 
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We arrive at the third of this group of equations, viz. (x6), 

by writing (12) as 

aty* = gy8r{By, a} + By, 
differentiating with respect to x, and observing that dg)47/dx 
is the same as dg,87/dx). : 

Thus 

Adt\*/d%q = 0(8,97{By, a})/d%q + dy/dXy 
= 0(g,P (By, a})/d%a + dy/dBR7 . dg7/dx) 
+ dy/dgaPY . MgF7/d%) 
= 0(g,1{By, a})/d%_ — {Be, n}yn, gay 
mag {By, a}dg,F7/dx) 
97(d{By, a}/d¥%a — {Be, n}{yn, ¢}) 
= — g)/7bzy. 

Equation (17) is the result of performing the following 
operations on (I5). Contracting it, multiplying by dg», 
and subtracting from (15), we have 

d(gHP{AB, a})/d¥%_ — 4eaXg*r{By, a} 


= fat," — (fy* — heyrd) . : ste) 
Now 
gHBLAB, a} = ZerBgr<(den</d%~ + Iwpe/dIX, — ?ap/d%e) 
= $grBoredgn./d%) 
= — g0gee/d% 
and 


EP Py, a} = HgF7g%*(dIepe/d%-y + Wre/d%B — 2 py/d%e) 
= gPIg2* dg R6/I%y — $8291 py/ dX 
= — 9g%7/dx_ — $%d log g/dx.. 


Introduce these expressions into the left-hand side of (C), 
put » equal to 7, operate with 0/d%,, and (17) is obtained. 

With regard to the very important equations (24), whose 
proof has also been deferred, it is obvious that if written out 
im extenso the proof would appear to be very complicated and 
laborious indeed. In his ‘‘ Space, Time and Gravitation,” 
Eddington says that he doubts ‘‘ whether anyone has performed 
the laborious task of verifying these identities by straightfor- 
ward algebra.’’ Nevertheless, in the mathematical supplement 
to the French translation of that work he gives an algebraic 
proof himself, which does not seem to be very involved or 
tedious. But the proof which follows is due to Jeffery, and 
appeared in the “ Phil. Mag.,” 43, pp. 600-603 (1922). It is 
quite brief and elegant. ; 

I 
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Referring to the ten equations (16) of Chapter IX., and the 
forty further equations to be obtained by differentiation with 
respect to %1, %2, %3, %4 respectively, making fifty equations in 
all, it is clear that we can obtain in an infinite number of ways 
values for the coefficients a), and their derivatives daa,/d*, 
(eighty quantities in all), which will allow us to prescribe the 
values of ga, and their first differential coefficients at one 
definite point-instant in the world. 

Selecting such a definite point-instant for the origin, trans- 
form the co-ordinates so that the first differential coefficients 
of the ga, vanish there. In general, second and higher deriva- 
tives do not vanish. 


The contracted covariant derivative of Ga, is 
(Gy%)q = IG)%/d%_ — {Aa, B}Gg* + {Ba, a}G)? 
which simply becomes at the origin 
IGy2/d%a 


which = 9(g"8Gyg)/ dx 
=e BOG g/ dX. 


Substituting the expression for G,, from Chapter IX., and 
omitting terms which vanish on differentiation, we obtain 


(Gy*)q = — BE B1D"(DBpe/ 0%, Tr rel 9% B — Wap] d%e) [Wg 
+ g%Bd8 log g/d%,d%_d%~g 
= — $e 76 d8gg./0%) IM dX + B28 log g/dx,dIX_dx%— (D) 
by cancellation of terms on summation. Again, 
9G/dx) = d(g**Gag)/d*) 


See BE7FR7d?(Iae/dXp i dRe/d%Xa 7 Wap] dX) /IXI%y 
+ gd8 log ¢/d%)d%qd%p. 


The first and second terms in the bracket are identical on 
summation. The third is 


5B R108 Qap/ 9% dX dX 
which, by modifying the dummy suffixes, 


= $e o7*982.,6/IX) Mg dB 
= $8°B02(g7dg4¢/d%q)/d%, IX p 
= g*898 log g/dx\d% dX p. 
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Hence ; 
IG/dxy = — grBg7<d8gg./9X)INqdINy + 2g78d8 log g/dx)dXqd%g. (E) 


Comparison of (D) and (E) shows that the relations (24) 
are identically satisfied at the origin in the chosen co-ordinates. 
As they are tensor relations, they are true at the origin in any 
system of co-ordinates. But the origin is any arbitrarily 
chosen point-instant. Hence they are true generally. 


16 


CHAPTER XII. 
ACTION. 


In the older mechanics the most general form which could be 
given to the differential equations of motion was first dis- 
covered by Lagrange, and a modification of the Lagrangian 
form was indicated later by Hamilton. Furthermore, it 
was discovered that dynamical theory could be succinctly 
expressed in a principle having the widest application. This 
principle could take one or other of two forms according 
as one or other of two conditions were imposed—the Principle 
of Least Action or Hamilton’s Principle. The belief gradually 
gained ground that as knowledge in Physical Science and 
Chemistry progressed, all natural phenomena would be found 
to be in the last resort mechanical phenomena occurring in the 
extremely complicated molecular and atomic systems which 
constitute matter, and so would come under this principle. 

The object of this chapter is, in the first instance, to deter- 
mine if this principle, generalised if necessary, still holds for 
Relativity dynamics and, in the second, to see how far physical 
phenomena are amenable to it when so generalised. 

As is well known, complex mechanical systems can be more 
conveniently described in terms of a number of generalised 
co-ordinates, such number being the number of the degrees of 
freedom of the system, rather than in terms of the Cartesian 
co-ordinates of its separate particles. We represent such 
generalised co-ordinates by the symbols q,, qo, 93, . - +» Qn- 
The rates at which these co-ordinates change we shall denote 
by 71, 7a. 1%: - « «. Mm They are, in fact, dgj/de, dg.idrwetcs 
the generalised velocities. They are usually indicated by the 
symbols g,, Jo, . . ., etc., in text-books of dynamics; but to 
avoid confusion we prefer to retain the dotted symbol to indi- 
cate differentiation with regard to separation or proper time 
rather than with regard to the special time of the frame of 
reference. 

The kinetic energy T of the system is a quadratic function 
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of the velocities 7,, 7:, etc., whose coefficients are in general 
functions of the co-ordinates q,, ¢2, etc. When the system 
receives an elementary displacement represented by 89, 84s, 
etc., its internal potential energy V will suffer a change arising 
from the work of the forces acting between its mutual parts, 
and at the same time work will be done on it by external forces 
involving a transference of energy to the system from its 
environment, which we represent by 


Q 1841 ate Qsdg2+ ... + Ondq, 


Q1, Qs, .- +, Qa being the generalised components of the 
external forces acting on the system. 

It is then demonstrable that the following equations of 
motion, due originally to Lagrange, are true: 


A(oT/d7,)/dt — oT /dq, = — V [241 + 1 


and  — I similar equations. 

By writing L for T — V, and bearing in mind that V depends 
only on the co-ordinates and not on the velocities, we can 
express these equations thus: 


d(dL/dr,)/dt — dL/29, = Ox, ete. 


L is usually called the Lagrangian function, and is the 
difference between the kinetic and potential energies of the 
system. 

There is another form for the general equations of motion 
due to Hamilton. In this the kinetic energy of the system is 
expressed in terms of the generalised components of momentum. 
These are the partial differential coefficients of the kinetic 


energy as expressed above with respect to 71, 73, ..., % 
respectively. Denoting the components of momentum by 
by po - - +» Pn, we have p, = dT/07,, Pp. = dT /2r,, etc. 
Since T is a quadratic function of 7;, 75, . . ., % it is easily 
seen that when expressed in terms of the components of 
momentum it is a quadratic function of f,, py, ..., Da, 
whose coefficients are in general functions of 91, 9s, . . ., Mu. 


We denote the kinetic energy when so written by the symbol 
T;. It transpires that Hamilton’s form for the equations of 
motion of the system are 


dT, (2p, = dg, [dt = 7, 


and » — 1 similar equations, 
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and oT,/09, = — 4p,/at — V/291+ 21 


and ” — 1 similar equations ; 
or, writing H for T, + V, we have 

0H /dp, = dq,/dt 
and m — 1 similar equations, 


dH /dq, = — ap, /dt + Qi, 


and ” — I similar equations. 

H is generally called the Hamiltonian function, and is 
equal to the sum of the kinetic and potential energies of the 
system expressed, of course, as a function of the co-ordinates 
and momenta. 

Suppose we follow a certain part of the history of such a 
system. Such a portion of its history is called a “ path;” 
the beginning of this path is indicated by a set of values for the 
gq, and 7;, or the gq; and ~;; similarly, the end or any other 
definite condition of the system at an instant of this history 
is defined by definite values of the same variables. Such sets 
of values succeed one another in time in accordance with the 
equations written above. But it is conceivable that by the 
introduction of adiabatic constraints (that is, constraints whose 
reactions would do no work on the system), the system might 
be made to describe an adjacent path from the same initial to 
the same final configuration as before removed from the first path 
by differences of co-ordinates and velocities of the first order of 
magnitude. In addition, the initial velocities in this varied 
path might be arranged so that the system would obey some 
further conditions ; for instance, it might describe the varied 
path in the same time, or it might describe it with the same 
total energy as before. 

Following the system in its actual path, we can obtain the 
integral /Ldt from the initial to the final configuration. Such an 
integral has the dimensions of a quantity which is the product 
of energy and time, and is referred to as Action in dynamical 
theory. We can also obtain the value of this integral for any 
assigned neighbouring path varied from the actual by means 
of adiabatic constraints which, of course, introduce no change 
into the form of the function V as expressed in terms of the 
co-ordinates. We proceed to state an important theorem 
concerning this variation of the action thus produced, under the 
condition that the initial velocities are so adjusted in the 
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varied path that it is. described, from the same initial to the 
same final condition, 7 the same time as the actual path. 

With this condition we can obviously set up a one-one 
correspondence between the states of the system in its actual 
history and those in its arbitrarily-varied history. A state in 
the actual path represented by the co-ordinates and velocities 
Ty Yar - - +» In» 1, 1a, . - -, Y, Will correspond to a state. on 
the varie’ path ¢, -+- 89,, . - -» Gn + Sdn, 73 + Ory, . . « 
7%, + 8%, Which is attained after the same lapse of time from 
the initial instant. The variation symbol 8 here refers not to 
an elementary displacement of the system along either the 
actual or varied path, but to a virtual displacement of the system 
from an assigned state on the actual path to an adjacent state 
on the varied path conditioned by an unvaried value of the 
time. Corresponding to every such assigned state we can 
calculate the work of the external forces on the system if it 
were actually to receive such a displacement. This virtual 
work, viz., 30;59;, we denote by 8W. To each state of the 
actual path would then correspond a definite value of 6W if a 
definite varied path is assigned, and this will also lead to an 
integral along the path, viz., /5Wdt. The theorem referred to 
and known as Hamilton’s Principle then asserts that 


S/Ldt+/sWdt=o0,  . ; (5) 


or the variation of the action together with the integral of the 
virtual work of the external forces is equal to zero. This 
theorem is obtainable from the Lagrangian equations of 
motion. On the other hand, if this theorem be taken as the 
fundamental principle of dynamical theory, the Lagrangian (or 
any other) form of the equations of motion can be deduced 
from it. 

It will now be our object to investigate what modifications, 
if any, must be introduced so that this theorem and its deduc- 
tions may be valid for the wider scope of Relativity dynamics, 
and, in the first instance, we shall restrict ourselves to the 
homaloidal world of the earlier theory. 

Referring to Chapters III. and VI., we see that the equations 
of motion of a particle in any reference frame are 


P, = d(mdx)/ds)/ds, 


where P) is the “‘ force-activity ” four-vector Bf, Bfy, Bf:,B(V - f). 
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In terms of the usual three-dimensional vectors the first three 
of these are summarised in 


f = d(mv)/dt, 


where, of course, m is not a constant but equal to m)(I — v?) —3, 
m, being the proper mass. Now let 6r represent a virtual 
displacement to a neighbouring position on a slightly varied 
path for the particle. Then 


(f.. or) = (2 ae) ee ér) 

= a(mv . 5r)/dt — (mv . dér/dt) 

d(mv . 6r)/di — (mv . ddr/dt) 
= d(mv . 6r)/dt — (mv . dv) 

L.€., (f . dr) + (mv . dv) = d(mv . sr)/dt. 


Hence, if we integrate this along the path of the particle, 
the integral of the left-hand side will be zero, for the integral 
of the right-hand side is the difference of the values of (mv . 8r) 
at the final and initial positions which are individually zero since 
6r is zero at the beginning and end of the varied path. 

Now the geometric product (Vv . 5v) is the ordinary product 
of the magnitudes of Vv and 8V, i.e., it is vdv, so 


JS (mv8o + (f . 8r)]dt = o. 


In the older mechanics mvdv being equal to mé(v?/2), and 
therefore also to 5(mv?/2), we should arrive at once at Hamil- 
ton’s Theorem for a single particle, which could then be 
easily extended to any mechanical system regarded as an 
aggregation of particles. But the variability of mass with 
velocity, which is a feature of Relativity mechanics, precludes 
such a step. However, the necessary modification is to hand, 
for it is not difficult to obtain a function whose differential 
coefficient with respect to v is equal to the momentum mv. It 
is, in fact, — m)(1 — v?)#, for 


d{— mo(I — v2)#}/dv = mgv(r — v2) -# 
= mv. 


Hence 
mvov + (f . dr) = 6K + (f Sar), 


where we write K for — m,(1 — v?)t, and so for a single particle 
/(8K + (£. dr)ldt = o. 


If we now consider a mechanical system consisting of an 
aggregate of particles which are acted on by external forces 


ACTION 249 


but which do not mutually act on one another, we can derive 
an equation of action for it by a simple summation of the 
equations for the individual particles. We represent S(f . 8r) 
by 5W, and we take the symbol K to mean 


=[— m (x — v)#] 
and so we arrive at 


8/Kdt+ /sWdt=o . a2) 
As a first approximation 
K = 43m,v? — =m, 


and so K approaches in value to a difference between a function 
4=m,v? the kinetic energy of the older mechanics, and =mp, 
which, according to the newer views, is just a quantity of 
energy, the internal energy of the individual particles, each 
one regarded as an isolated system. Of course, K is really 
equal to B—12m,(B — 1) — 2m, = B-12(m — my) — =m, and 
B-iz(m — my) is really a function somewhat less than the 
kinetic energy of Relativity mechanics. 

Apparently the next step in the extension of our views is 
to consider a system of particles in which interaction between 
the particles takes place as well as the action of external forces. 
But, if we do so, we shall be carrying into our new synthesis 
ideas from the older science which are really foreign to the 
Relativity standpoint. Action at a distance across intervening 
space between particles according to some law of distance will 
not fit into the laws of Relativity dynamics as we already 
know. So we cannot quietly absorb that part of =(f. 6r) 
which refers to “ internal’”’ forces into a term such as — dV, 
representing a decrease in some potential function. To be 
logical the step from an aggregate of “inert ’’ particles must 
be to a system for which, although energy and mass are still 
highly concentrated within certain small regions, which we 
may refer to as particles, yet some of the energy and mass is 
diffused with a small but finite density in the remaining portion 
of the space occupied by the system. This implies that K is in 
reality an integral such as /{}— po(I — v*)#}d79 (where py is now 
a proper density, dry an element of proper volume) rather than 
a sum of a finite number of terms. K would still be the differ- 
ence between a term depending on v and approximating to the 
kinetic energy, and a term /od7), which would represent 
intrinsic energy or mass localised mainly but not entirely in 
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certain minute regions; the “ diffused” part of the intrinsic 
energy would, in a sense, correspond to the mutual potential 
energy of the particles which arises quite naturally in the 
mathematical working out of the earlier views. 

In the older Physics great success attended the application 
of equation (1) to all kinds of physical systems in which the 
view that they were merely aggregates of particles mutually 
influencing one another by conservative forces was discreetly 
removed to the background, and in which the object of the 
mathematical physicist investigating a definite problem was 
to determine the correct mathematical form to be assigned to 
the function Z in terms of the variables which define the 
physical state of the given system. In a word, the underlying 
idea was that the whole behaviour of any physical system could 
be, as it were, summarised in a definite mathematical expres- 
sion. It is but natural to inquire if there is any promise of 
similar success in the newer Physics, supposing an attempt is 
made to discover a suitable form for the function K (the 
“ kinetic potential ’’) of equation (2). 

At once we are faced with a vital question—Does the prin- 
ciple even in the case of purely mechanical systems accord with 
the postulates of Relativity ? We can hardly think otherwise 
considering that equation (2) has been deduced from equations 
of motion which themselves agree with these postulates. How- 
ever, it will be instructive to give a direct proof of the fact. 
Let us, therefore, investigate the conditions which must hold 
in order that if we transform from a frame of reference S to 
another frame S’, then the equation 


8/Kat + [Wat =o 


transforms into 
8/ K'dt' +. /8W'dt' = o. 


(Remember we are still postulating the restricted Relativity of 
a homaloidal world.) 

If the system be regarded as a group of particles, each par- 
ticle will have an individual path in any frame, and to each indi- 
vidual path will correspond a definite world-line. Now taking 
the frame S, let us consider the path of a definite particle and 
its neighbouring varied path. These will correspond to a world- 
line and a varied world-line. The correspondence of the points 
on the two paths will yield a correspondence of events on the 
world-lines such that (x1, %, %3, %4) corresponds to (x, + 8%, 
, + 4) %4 + 6%). Now our method of linking the positions on 
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“the paths by means of time, obviously means that the corre- 
spondence of events on the world-lines is given by the condition 


Ox, ='0. 


Let us indicate points on the actual path by A, B, etc., and 
the corresponding points on the varied path by Q, R, etc. If 
now we transfer our thoughts to the frame S’, we are faced 
with a temporary difficulty arising from our enlarged views 
concerning simultaneity. The points on the varied path in 
frame S’ corresponding to A, B, . . . on the actual path are 
no longer Q, R, .. . ; they are points Y, Z, . . . associated 
with events on the varied world-line which are related to the 
events associated with points A, B, . . . by the condition 


OX, = oO, 


and the condition 6x,’ = o does not in general agree with the 
condition 6x, 0. However, the difficulty is easily removed 
if we remember that since the force-activity four-vector P) is 
related to the momentum-energy four-vector by the equation 
Py = d(modx)/ds)/ds = mx, it follows that P,x, is zero since 
Nate IS zero. Suppose, therefore, x, refers to A, %) + 6%) 
to QO in frame S; let x,’ and x,’ + Ax,’ refer to A and Q 
in the frame S’; and let x,’ + 6x)’ refer to Y in frame S’. 
We have the conditions 


64 ==-0 
0 
but AO. 


By the well-known invariance of geometric products 


PO%g = Fa Akg: 
Now Ax,’ = 8%)’ + ex)’, 
where ¢ is some quantity of the first order of magnitude ; for 


Y and Q are on the same path (the varied one), and so the 
four-vector “ direction-cosines ’’ of QY are x)’. Hence 


ee Ong ery ve 
tr Ot 
Therefore 
POC Pe Ota 
When translated into terms of frames of reference this 
becomes, bearing in mind the conditions, 6%, = 0 = 6%,’ 


A(f . Sr) = p(t’ . sr’). 
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Now the infinitesimal interval of time dt in one frame 
between A and B is connected with the interval dt’ between 
A and B as measured in the other by the equation 


at/B = dt’ /p’, 
for each is the element of proper time. Hence 
(ff. orjdt=—=(f (or jade 


Thus since in each frame the total time of the path is 
unvaried, it follows that 


S(£ . 8r)dt 
ls invariant. 


There is no difficulty now in considering all the particles 
and proving that 
Swat 
is invariant. 
In order, therefore, that equation (2) may accord with the 
Relativity postulates, we have the condition that /Kdt should 


be invariant. Now in so far as K is obtained in the manner 
suggested by particle dynamics, this is certainly true. For 


K = — ym, (1 — v?)# 
= =m,/B, 


/Kat = — Xmfat]p} 
= — 2 {mo/ds} 


= — Z{mo(s; — S;)}. 


and therefore 


It follows that in the extension of the Principle of Stationary 
Action to physical systems in general an inevitable condition 
to be satisfied by the kinetic potential chosen for any particular 
problem is that the Action derived from such a choice must 
be invariant for all frames of reference. 

We are naturally impelled to inquire if a similar proviso 
has to be established in order to introduce the Principle of 
Stationary Action into General Relativity. But before pro- 
ceeding to deal with this very important question it will be 
instructive to apply the Principle to the case of a general 
dynamical system and to the case of the electromagnetic field. 

For a general dynamical system defined as above it follows 
from (2) that 


SZ (QL/2qi « 8g:) + VL/d7; . 87;) + TOSq;}dt = 0, 
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puce we write once more the customary symbol L instead 
of K. 

Also, as 987; = 8(dq;/dt) = d(8q,)/dt, it follows by steps 
which will be quite familiar to readers of works on general 
dynamics that 


PEL[2qi — a(2L [>r,) dt + Q,)8q,}dt = 0, 
and since 8q,, etc., are arbitrary variations, the equations 
d(dL/dr3)/at — dL/dq, = Qy, etc., 
are true. 
So Lagrange’s form of the equations of motion are valid in 
Relativity dynamics except that the kinetic potential L is no 
longer the difference between kinetic and potential energies. 


We can also arrive at a Hamiltonian form for the equations 
of motion. Just as for an individual particle 


d{— me(x — v9) }/at 


is equal to the momentum mv, so we can differentiate the 
function L for any dynamical system with respect to 7, 7», etc., 
and call the quantities so obtained generalised momenta, 
denoting them by #,, pe, etc., as before. Now consider a 
function which is equal to 


Pit + Pata t+... + der, —L, 


but which is expressed in terms of the co-ordinates and momenta. 
Denote it by H. It is easy to apply familiar methods and 
obtain 

dH/2q1 = — L/24, 


and  — 1 similar equations, together with 
H/p1 = 11 
and ” — 1 similar equations. 


From Lagrange’s equations it is possible to proceed by steps 
(which once more will be familiar to most readers) to the 
following form of the equations of motion : 


dq,/at = dH/d, 
and ” — 1 similar equations, together with 
dp, /dt = — dH/29. + Q1 


and ” — I similar equations. 
It is not difficult by transforming back to Cartesian 
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co-ordinates to show that H is equal to =m for the particles 
of the system, i.e., to the kinetic energy of the system plus the 
intrinsic energy of its component particles (which latter may be 
regarded as a constant during many physical processes). Thus 
the Hamiltonian function H turns out to be, just as in the older 
dynamics, the total energy of the system expressed in a particu- 
lar mathematical form. If a system is self-contained and 
isolated, the principle becomes 


8/Ldt = o. 


In extending the Principle of Stationary Action so as to 
cover the dynamics of continuous media, it is clear that the 
kinetic potential employed will be expressed as a triple integral 
whose integrand is the difference between the densities of 
kinetic energy and energy of strain. In this way it follows that 
the various elastic solid theories of Light put forward in the 
nineteenth century can also be summarised in the Action 
Principle. Finally, as was pointed out by several writers, 
notably Larmor, the equations of the electromagnetic field can 
likewise be expressed in a compressed form by equating the 
variation of a certain integral to zero. This integral will 
involve not only the co-ordinates and velocities of the charged 
corpuscles (electrons and nuclei) whose motion constitute the 
conduction and convection currents, but also some variables in 
terms of which the field quantities are expressible. The most 
convenient variables are found to be the components of 
the three-vector potential and the scalar potential, i.e., the 
four components of the four-vector potential. In the treat- 
ment of a dynamical system not only do the generalised 
co-ordinates appear in the kinetic potential, but also their 
differential coefficients with respect to the time; so in the 
treatment of the field not only do the components of the vector 
potential appear, but also their differential coefficients with 
respect to the four space-time co-ordinates. They enter, in 
fact, into the function involved in the kinetic potential through 
the six field components, or the components of the field tensor 
FF defined by 

F Carta yen f =i) 


If we now use the symbol L to denote the density of the 
kinetic potential, we have to consider how the equations of the 
field can be resumed by the equation 


S///L(A, ¥, x, x)\dxdydzdt = o, 
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under some limiting conditions to be determined, L being a 
BeNOR A yee a ged ey ed gg hy) Mg hy 
aewi: 

Since the differential element dx dy dz dt or dx, dx, dx3 dx, 
of space-time, which we shall in future denote by dw, is an in- 
variant, it follows that in order to satisfy the postulates of 
Relativity, the function ZL must also be invariant. 

Suppose now that the paths of the charged material particles 
are not varied, but that the forms of the functions which 
express the components of A in terms of %,, %2, %3, %4 are varied 
infinitesimally from those which actually hold in virtue of the 
laws of the field, the varied functions being A, + 6A). This 
variation of A will produce a variation in f given by 


oF = Curl 5A. 


We also assume that the variations vanish at the three- 
dimensional boundary of the extension in the world through 
which the integration is effected. (This corresponds to the 
vanishing of the virtual displacements at the initial and final 
states of a dynamical system.) Alternatively we can extend 
the integration to boundaries so far removed that the field 
components are zero beyond them. 

The vanishing of the variation of the action under the 
conditions stated yields the equation 


Sir [PAq . 84g + L/P ag . SF ap}dw = 0; 


for 8x =0=> 8%). 
This is equivalent to 


SPL/DAg . 8Ag + W/dFag . (238A g/d%q — 28A,/dx~)}dw = 0, 


interchanging variation and differentiation. 
By a small modification of the dummy suffixes this can be 
written 


SidL[Ag - 8Ag + (OL) pa — L/dF ag)d(8A q)/dap}dw = 0, 
and this works out to 


SPLPAg. 8Ag + d[(2L/IF ga — IL/dF ag)dA o)/d%s 
— §8A,d(dL/dF ga — IL/dFag)/dxp}dw = 0 


which reduces to 


JS PLA + PL/F ag — L/F ga)/d%p}5A dw = 0, 
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since the triple integral over the boundary vanishes because 
dA), is zero at the boundary. any 

Hence on account of the arbitrary nature of the variations 
5A), we have 


IL/dAy + I(L/¥Faq — L/dFaa)/d%q = O - 4) 
If now the function ZL has such a form that 
OL oF ap = $F xp ; : . (5) 


the equation (4) becomes 
IL/dAy + I ya/d%a = O, 


remembering the anti-symmetry of F,,, and this would agree 
with field equation of the Lorentz theory, 


J = Lor J, 
provided 
ILA, == — Jy. ; : ts), 


It is not difficult to show that equations (5) and (6) are 
satisfied if L has the form 


(F .F) — (A.J) — bo 
or 4 FupF ag ae Aula — Fo : : : (7) 


where py is the proper density of the matter in the system. 
The last term in (7) would correspond to the part of the action 
which would exist if there were no field, viz., — jigdw * for this 
is equal to — //ppd7,ds = — Z/mpds, agreeing with a previous 
result. It is easily seen that if we differentiate the above ex- 
pression with respect to Fa, we obtain $/,,, and if with respect 
to A, we obtain J,; for in both these differentiations there is 
no variation of the world-lines of the matter, and so py and J 
are constant. 

By imposing a variation under different conditions on the 
form (7) and equating it to zero, we arrive at the equations of 
motion for a charged particle as propounded by Lorentz. 
Suppose, for instance, that with unvaried values of A and fF, 
we subject the stream lines of the particles to a small variation, 
this will involve a variation of J and of jp, and hence 


S/Ldw = — fA. 83)dw — S/jigdwn, 


so that if the variation is zero, 


[A .83)dw = — Sfjigder. 
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Now it is not difficult to show that 
dJ = Lor [J . 8R] 


where 5R is the vector (8%, 5x5, 8%, 5x4), ie. the virtual dis- 
placements in space and (imaginary) time of an element of 
the current. For instance, 


8], = dJy/d%q . 8%q 
dJy/d%_ . 8%, — Div J . 8x) (since Div J = 0) 
(J 8% eae J a8%)/d%a, 
which is the A-component of Lor [J . SR]. 
Therefore 
NA. 83)dw = f(A . Lor [J . 8R])dw 
= —/Div[A. [J . SR]]jdw 
+/({J .8R] . Curl A)dw.* 


The integral of the Divergence can be expressed as an 
integral over the three-dimensional boundaries (using a four- 
dimensional analogue of Gauss’ Theorem), and this will be 
zero by reason of the zero values of the variations imposed there. 


Hence 
SA . 63) dw = /(Curl A .[J .dR})dw 
= —f({J . Curl A] . 8R)dw + 


= — f(T . F) . &R)dw. 


Now if we were considering the charged matter alone, apart 
from the field, as a dynamical system, its action would be 


— Spode, and by the proposition established earlier, 
— 8/u9dw + virtual work of the external forces on matter = o. 
Hence the virtual work of the forces on the matter 


= 8/pdw 

=f/([J . F).sR)dw 

= /io(f . 8r) — p(v . f)dt}drdt 
= fip(f . [Sr — vé])}drdt, 


f—e-+[v. hl. 


tl ll 


where 


* This follows from a theorem which can easily be proved : 
Div [A . 36] = (% . Curl A) — (A. Lor B). 
+ It is easy to prove that 
(A. (A. B]) = — ((A. A). B) = ((B. A]. A). 
17 
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Now since 5r — vét is the virtual displacement of a 
charged element to a point on its varied path for the same value 
of t, this result is consistent with the assumption that f or 
e + [v.h] is the force exerted by the field on the charged 
matter per unit of charge. 

Hence the value suggested for L yields, by the application 
of the Action Principle, the electromagnetic equations when 
one condition is imposed on the variations of the variables 
occurring in the function, and the mechanical equations when 
another condition is imposed. 

The three terms occurring in (7) can be associated individu- 
ally with the “‘ field,” the “‘ electricity,”’ and the ‘‘ matter ” re- 
spectively. The second term can be expressed in another form 
which brings out its association with the “ electricity ’”’ very 
clearly. Thus suppose we divide the charges into infinitesimal 
amounts de (which are, of course, invariant), and consider their 
world-lines ; then 


(A . J)dw = (Ajpv, + Apu, + Azpv; + tA gp)dx,dx.dxdx, 
== (pdx,d% ax (Adx,+ ... + Adx,) 
== ereCt gay.) 


where dx,, dX, dx3, dx, are the components of an element of 
the world-line of de. 
Hence 


NA. S)dw = fA . ds)de 


where the one integration is taken along the world-line of an 
element of charge de, and the other integration taken over all 
the charge in the system. 

In a similar way we can show that 


Spd = //dsdmy 


where ds is an element of the world-line of an element of matter 
whose proper mass is dm, the integration being along these 
world-lines and over all the matter in the system. 

This brief account shows that the Principle of Action, in so 
far as it was valid in the older Physics is consistent with the 
Restricted Principle of Relativity. The further steps needed 
to demonstrate its consistency with General Relativity are due 
to Einstein himself, and to Hilbert, Lorentz, Klein, and Weyl. 

Thus, in so far as the phenomena of the electromagnetic 
field and the laws of mechanics are concerned, it is extremely 
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easy to generalise expression (7). One important fact which 
we have to bear in mind is that in General Relativity the 
element of space-time dw is no longer an invariant, but qdw, 
where ¢g = ,/(—g), is so; in consequence the integrand in 
the action is no longer an invariant but the product of an 
invariant by g. Such a product is called a “scalar density ” 
by Weyl, and he also refers to the product of a vector or 
tensor by g as a “ vector-density ”’ or “ tensor-density ;”’ the 
integrals of the products of such densities by dw are scalars, 
vectors, or tensors. The expression which supplants (7) still 
consists of three terms. The first is now written 


igF apl’ ap. 
The second term in the action still stands as 


SaefA ate, 


and this can be converted into 
SGA a J “dw, 


dé = pod, = par, 


for 


by the invariance of the charge, and so 
dedx, = pdrdx,dx)/dx, 
—— OU yd 
= 7 ]*dw. 
Thus the second term in the expression replacing (7) is 
gA,J*dw. 
The third term in the action is still 


Sfdmds 


which will, of course, now involve the g,,-potentials. This 
can be expressed in an alternative fashion by writing for 
dm ds 
Aimy Lapa ,4Xp/ds, 
which is equal to 
AML apXax pas 
== [dT 9B apXaxpds: 
= WapporXax paw 
= ggapl “Bdw 
= qIdw 
’ where 7 is the contravariant matter tensor of Chapter XI. 
Lj 
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So far, therefore, as we know the expression for the action, 
we can write it as 


} /qFapFebdw — /defA dx, — [fdmeds 8) 
or SitFaF® —A,J*—T)do. .  .  - (9) 


As before, if we vary the electromagnetic potentials A), 
and in consequence the field tensor F,, we find, on reference 
to equation (8) of Chapter X., which gives the relation between 
Fw and Fy,, that the variation of the action is 


SQ(hF "SF ag — J°Aq)dw, . é . (xO) 
provided, of course, the gravitational potentials g,, are not 
varied in functional form, nor the world-lines of matter varied 
in position. 

Suppose, however, that we vary the ga,-potentials, but 
keep the A)-potentials and the world-lines of the matter 
unvaried, we find that the integrand of (9) is varied by 


2Fapd(qh*®) — gIogag, 


for, of course, the contravariant field tensor F*“ depends on the 
£a,-potentials. Now 


6(qF%8) = Fe83q — gFy.8(gr7g8*) 
= $9 F Ber dpe + 2gGF yegrIghng Sgn6, 


by an appeal to equations (6) and (3) of Chapter IX. 
Hence the variation in the integrand of (9) is equal to 


q(Fapk yg gong Sena + PF pl Pg7bey< — T*8dgag) 
= q(— g89FapF Seno + ZF ap 8 g7b gy. — TPdgap) 
= — q(T + E*8)8gag, 


where E* is the contravariant stress-momentum-energy tensor 
of the electromagnetic field introduced in Chapter X. 

In consequence, if both types of potentials receive arbitrary 
functional variations 6A) and 8gyu, it follows that the variation 
of the expression written in (9) for the action is 


SU4F SF ag — J°Aq — $(T8 + E8)8gag)dw, . (11) 


where we could make the identification so often signalised in 


the theory and refer to Ty, + E,, as the complete matter 
tensor. 


If now we wish to use the action function (9) and its varia- 
tion (II) so as to obtain the electromagnetic and mechanical 
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equations, we can adopt the following elegant method of pro- 
cedure, due to Klein. 

Conceive that an infinitesimal deformation is applied to 
the world continuum, which displaces the point-instant 
(1, Xa, %3, X4) tO (Vz, Va, Vs, Ya) Where 


Vy = %) + €X4, : ‘ = (12) 


e being a constant infinitesimal parameter and X* being a 
contravariant vector function of %,, %2, %3, %4. | We also con- 
ceive that the covariant vector potential A, receives an infini- 
tesimal variation im tts functional form which transforms it to 
B) in such manner that the invariant product (A . ds) for an 
element of an undisplaced world-line is equal to (B. ds) for the 
corresponding element of the displaced world-line, i.e., that 


A ,(x)d%q a B,(y)4ya, : 2 g (13) 


where we use A,(x), B)(x) to denote the values of A, and By 
at the undisplaced world-point (%1, %9, %3, 4), and A,(¥), By(y) 
to denote values at the displaced world-point (41, 2, ¥3, ¥4)- 
The variational symbol 8 is then to refer to the change at the 
undisplaced world-point, so that 


SAV We SA\ ae PD) 


This variation will involve a variation in Fay, for since Fay 
is defined by 
Pay = dA, /d%, — dAy/dx, : 3 (55) 


it follows that the new functional form for Fa,, which we shall 
denote by Hy,, is given by 


Je ky, = dB,/d% = WB)y/d%p. 


Employing (13), it is not difficult to show by means of the 
four-dimensional analogue of Stoke’s Theorem that 


Fag(x)dx,dxp = Hag(y)dyadyp. : G3) 
As before, 
OF yp, = Hay(*) — Fxlx): -. : ery) 


The varied functional form of the ga,, which we will denote 
by hay, is to be obtained by a process similar to (17), viz., it 
has to satisfy the condition that an element of separation in a 
world-line is not to be altered, i.e., 


8ap(*)d%qdxg = hap(y)dyadye ees 5) 
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and Sgn, is then defined by 


Sgau = Maul*) — Srul%). « . . (19) 

We shall denote the variation introduced into the function 

L by these changes in Ay, Fay, gaz by 5L, but we must be careful 
to note that the variation of /gLdw throughout a defined region 
of the world is not now /S(gL)dw, because there has been a 


deformation of the world continuum itself. The complete 
variation is, in fact, 


Jd(qL)dw + fd(qLX*)[dx,. dw, . . Go) 


as those familiar with the ‘‘ divergence’’ processes used in 
hydrodynamics and elasticity will recognise. 

The second term in (20) arises from the displacement of a 
small portion of the continuum across the original three- 
dimensional boundary. 

We can calculate the first term in (20) by writing, in accord- 
ance with (II), 


S(qL) = gqF oF ap — gJ%Aq — 3qT 0gap 
(where 7% now stands for the complete matter tensor), and 


replacing dF ag, 544, 584g by values calculated from (14), (17), 
(19). Thus by reason of (13) 


(Ba + eX8)B,/d%p) (dxq + «dX*/dxg . dxg) = Agd%q, 


where 5, 1s, Of course, estimated at (vy, 4.04704) 
Hence, neglecting the square of e, we obtain after a little 
modification of dummy suffixes, 


(8Aq + eX#dB,/duxg + eBgdX8ldx,)\dxq = 0: 


= taking the individual multiplier of dx, and going to the 

ae ye — (AN oey XDA i010 
Similarly we can prove that 

OP ay = — €(PyqgdX%/d%) + FaydX*/d%, + ADF yy /0%5) ~ (22) 

O2ap = — €(GradX%/d%) + Lay IX4/9%y + Akay [OXe)e~ (23) 


If we insert these values in the above we find after a rather 
tedious but not difficult collection of terms that 
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(GL) = ge{(SayT®Y — FayF8y + AqJ8)dX*/dx¢ 
+ (2187 0gey/d%_ — $F8Y0F gy/d%q + 7 J8dAg/dx_) Xf. 


The integrand of the second term in (20) can be written as 
GaP LIX*/dxg + X*d(gaFGL)/dx~. 
So if we write V8 for the mixed tensor 
Too —FuyFOv + AgJ@+ gL .  . (2d) 
we have the total variation of /gLdw, viz., 


—/dew{qV dX*/dxg + ($qT B70 gpy/d%q — 4QFPIDF py/d%a 
+ gJ®dAp/d%— + 2(8aPQL)/d%~)X*}. 


Now if we assume that this variation is zero for any arbitrary 
displacements of the world-points (subject, of course, to the 
provisos concerning the variations in A, and ga, laid down 
above), i.e., for arbitrary values of X and its derivatives, the 
individual multipliers of )X¢/dx%g and X* must be zero. Hence 
we have 

V yy =0; 
or 
Py = Fyre — A, JP — eel. 5 cS (25) 


To obtain the required equations now is not difficult, for if 
we assume that the electromagnetic field equations are true 
(and they can be derived from the Action Principle by a suit- 
ably restricted variation, i.e., one in which the ga, do not 
vary, but only the A)), we can, by dropping out the terms 
above which involve the field quantities, and which give a 
zero result individually because of the field equations, obtain 


&/4[T PdX*)/dxg + (kT dg8y/d%q)X*]dw = 0, 
ie., /{( — d(qTu8)/d%— + 4qT 80g 8y/d%q)X* 
+ d(qT8X*%)/dxg\dw = 0. 


If, as usual, the region of integration is assumed to be 
bounded by a three-dimensional region where 7 is Zero, 
i.e., where matter does not exist, the divergence part of the 
integral can be transformed to a boundary integral which 
vanishes identically. Hence it follows that 


g~19(gT )%)/d%q = 4Td808/0% ), 


which is just equation (4B) of Chapter XI., because 7)" now 
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includes the electromagnetic field energy-momentum-stress 
tensor E)“ as well as the “ pure ’’ matter tensor. 

We have seen in Chapter XI. that the mechanical equations 
are actually a part of Einstein’s law of gravitation—they can be 
deduced from it. It is but natural to inquire if this law of 
gravitation can itself be brought within the ambit of the 
Principle of Stationary Action. If that were so, Physics would 
have achieved a purpose always implicit in the work of its 
devotees, it would have, in the words of Weyl, ‘“‘ reduced the 
totality of natural phenomena to a single physical law.” Of 
course, the accomplishment of such a purpose could not be 
claimed until we had discovered a mathematical form for 
the action function in terms of the fundamental variables of 
the fields agreeing completely with our knowledge of the 
real world, and no such claim can be made at the moment. 
We shall, however, conclude this chapter by expounding two 
methods of bringing the law of gravitation under the Principle 
of Action, one due to Einstein and one to Lorentz. 

What Einstein was able to show was that his law of gravita- 
tion within matter could be deduced from the proposition that 
the integral 

Sq(G — 8axT)dw 
is stationary or 
8/q(G — 8rxT)dw = 0, 


where T is the scalar invariant of the matter tensor, G the 
scalar invariant of the Einstein tensor G),, regarded as a 
function of the contravariant fundamental tensor g*“ and its 
first and second order differential coefficients. (The constant 
« is the one introduced in Chapter XI.) 

To prove this we begin by putting a limitation on the 
co-ordinate system which will be removed later; the deter- 
minant g is to be equal to minus unity. 

Now conceive a finite extension in the world (i.e., the history 
during a finite time of a limited region of a space frame of 
reference). We consider the g and g, as fifty variables 
whose values at each point-instant define the condition at that 
point-instant and its rate of change (spatial and temporal).* In 
any definite system of reference, the values of these variables at 
each point pass through a definite sequence of values between 
two definite instants, provided the initial conditions at the 
first instant are assigned, i.e., provided we assign the initial 


* In what follows g, stands for )gd«/)¥, and gy, for d?gAH/)4, 4x. 
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Cistribution of energy, momentum, and stress. At any point- 
instant the value of the function y introduced in (7) of Chap- 
ter XI. is calculated, and from it is obtained the value of the 


integral 
JN ax 14% AX 3% 4 OF Sxdw 


throughout the finite extension in space-time which is 
bounded by a three-dimensional manifold which we will denote 
by M;. This boundary will consist of the region of the space- 
frame *, = a constant, k, which is bounded by some definite 
surface in this space-frame, #(%,, %2, %3) = 0; the region at 
the end of the sequence of phenomena in x, = k’ bounded by 
the surface with the same equation, and the tubular three- 
dimensional manifold generated by the world-lines of the 
points on the two-dimensional bounding surface in the space- 
frame. The “initial” and “final’’ instants are, in fact, 
Ze A he == Rh 

Now we might conceive that the g’» and g,\ had slightly 
varied values g** + dg and g,A¥ + dg, at each point-instant 
which are not consistent with the law of gravitation, and if 
these values were introduced into the integral above, a slightly 
varied value of it would be obtained. The variation would be 
given by integrating dy or — {By, a}dg,87 — {Be, O}{y8, e}dg87 
(making use of equations (9) of Chapter XI). 


But dgqFY = d(dg87)/d%q. 


Hence the first term in the integral can be converted by an 
integration by parts into 


Sir{By, a}/d%q . 8g81 — d({By, a}8g47)/dxq]dw. 


By an analogue of Green’s theorem, the second term in 
this can be converted into an integral over the boundary M;, 


such as 
SnakBy, adhd M 5, 
N14, Nz, Nz, NM, being the “ direction cosines ”’ of the line at any 
point of M@, normal to it.* 
* If the equation of M, is 
f(% Ho, %3, 4% 4,) ==.0; 


m = of/om - (E(df/d%)") — 3, ete. 


then 
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Hence it readily appears that 


8/xdw = /bpydg8idw — [nf By, a}dg87dM 3, 


where ¢ is the degenerate form of Gy, when g = — I. 
But by Einstein’s law, if we restrict the co-ordinates to 
satisfy g = —1, the first integral in the right-hand side is 


equal to 
— 8ax/(T py — 29,1) dghrdw. 


The second term of this is zero, for gg,dgS7 is zero provided 
gisconstant. The first term is equal to 


8ire/[gVT py — 8(g97T py) dew 
= 87 /(g88T py — ST) dw. 


Collecting terms, we see that 
S/ix + 87xT)dw = 8a /gb1ST gydw — /na{By, a}dg87dM 5. 


Now we suppose (as in the Principle of Least Action in 
general dynamics) that the initial and final “ configurations ” 
are given, i.e., that the varied history starts from the same set 
of values for the g*“ at each point within (%,, %2, x3) = 0 in 
the space-frame at x, = k as the actual history ; and also ends 
in the same set as the actual history at x, =k’. Wecan also 
regard the surface #(%,, %s, x3) = 0 as so far removed from 
matter during the history that the g# have their constant 
Galilean values there. 

As a consequence of these provisos the triple integral 
vanishes, since 6g*# is zero at all point-instants in M3. If the 
quadruple integral is to vanish we must assume the distribution 
of energy, stress, and momentum to be the same during the 
varied history as during the actual. This is the analogue of 
the assumed constancy of the energy in one of the forms by 
which the Principle of Least Action is expressed. As a matter 
of fact, constancy of time, k’ — k, and of energy-stress-momen- 
tum are postulated here; and with these conditions satisfied 


S/x + 8rxT)dw = 0, 


or the quadruple integral has a stationary value in the actual 
history of the gravitational system as in any neighbouring 
arbitrarily varied history. 

It must be observed, however, that the co-ordinates have 
to be chosen to agree with the condition g = —vz1. This 
restriction can be removed easily, as follows : 
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X — SPr{By, a}/d%q = x ~ (ghr{By, a})/Ixa + gaFY{By, a} 
= — x — d(g47{By, a})/dxq 


by equations (8) and (9) of Chapter XI. 
aus by the definition of y, the left-hand side is g87¢g, or 4, 
and so 


S/pdw = — 5/xdw — a boundary integral throughout M,, 


which will vanish as before. 
Thus with the same conditions as previously, 


5/(6 — 82xT)dw = 0. 
If we now consider the integral 
SUG — 8rxT)dw, 


the integrand is the product of two invariants, viz., G — 87«T 
and gdw. Hence the value of the integral is an invariant, and 
so if its variation is zero under arbitrary variation of the g 
and g,** in any one frame of reference, it must be zero in all. 
But this is so for frames in which g = — 1. Hence with no 
restriction as to choice of co-ordinates, but under the ‘“‘ boundary 
conditions ’’ imposed above, we have 


S/4(G —8axT)do=0 . . « (26) 


as the most general expression of Einstein’s law of gravitation 
and dynamics. 

Lorentz and Hilbert attempt to complete the action function 
as expressed above in (9) by introducing an additional term to 
represent the action of the gravitational field. In (9) we have a 
term involving A, representing the ‘‘ substance action” of the 
electricity, a term involving linear functions of the first deri- 
vatives of A, to represent the “ field-action ”’ of the electricity, 
and a term involving ga, representing the substance action of 
mass. Apparently all that is needed to complete the function 
and round everything off with mathematical elegance is to add 
a fourth term to represent the “ field-action”’ of the matter, 
i.e., gravitation; such a term should involve linear functions 
of the first derivatives of the ga,, the most natural functions to 
choose being the [Ap, v] or {Ay, v} indexes of Christoffel. But, 
unfortunately, we know of no invariants which involve only the 
£au or g“ and their first derivatives. We have, of course, the 
contracted Riemann tensor Ga, and its scalar invariant 
G(= g*®Gag) ; but these involve the second derivatives of gay 


linearly. 
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However, we can make shift by the well-worn device of 
introducing a partial integration and discarding a divergence 
integral which can be reduced to a boundary integral vanishing 
at the limits of the region of integration. 

Stated formally, the action is now written 

SUL +L,+L; + L,)dw. 
(a) L, is the substance action of the electricity, and is 
—A,J*. 
(b) Lis the field action of the electricity, and is 
1 Fgh 48, 
(c) L is the substance action of matter, and is 
nas Lapl 8. 

(d) On referring to (17) of Chapter IX., it will be seen that 

we can divide the tensor G,, into two parts : 
a, B}{HB, ay — tru, a}{aB, BS 
and dfra, a}/dx, — dIfAu, a}/dMXq, 


the first part only involving the first derivatives of the gay 
(or g“), the second part involving second derivatives. So the 
scalar invariant can also be split into two parts: 


gr{ya, B}{eB, a} — g7*{ye, a}{aB, B} 
and gy df{ya, a}/oxe — grdf{ye, a}/dIXq. 
We take L, to be a multiple of the first part, so that 


L, = khg({ya, B}{eB, a} — {ye, a}{aB, B}) 


where k is some numerical constant, so that L, involves only 
the functions {Ay, v}, ie., the derivatives of ga, to the first 
order, but no higher. But L, is not an invariant.* 

However, if we consider the variation of the integral 


SQL dw, 


it can be shown by the device of integration by parts now 
familiar to the reader that it is equal to 


k/qK apog’ aBd ey 


* The reader will observe that this splitting of G,, and G is quite 
different from the splitting into ¢,, and y,. In fact, L,4 is not invari- 
ant even for a co-ordinate system satisfying the condition g = — 1. 
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where Ka, is the tensor 
* $8auG — Gay, 


so that although the term_/gl ,dw of the proposed action func- 
tion is not invariant, its variation is, and, after all, although 
the desire for complete mathematical elegance is not satisfied, 
the necessities of the physical principle are. 

It will now be obvious to the reader by a repetition of 
former steps that 

1° if we vary the electromagnetic potentials A, alone and 
put the variation equal to zero we obtain the electromagnetic 
field equations. 

2° if we vary the gravitational potentials alone and annul 
the variation we shall obtain 


Kap — Tan — Eny = 0. 


A reference to (22) or (22A) of Chapter XI. shows that this 
is Einstein’s gravitational equation, provided we absorb as 
usual the electromagnetic energy-tensor into the matter tensor 
and identify the constant & with (87x) —1. 


CHAPTER: XU 
SOLUTION OF EINSTEIN’S GRAVITATIONAL EQUATION. 


At this point it would appear advisable to turn aside for a 
while from the general question of the conformity of the laws 
of Nature with the principle of General Relativity, and deal 
briefly with the mathematical problem involved in solving the 
gravitational field equations proposed by Einstein. No general 
solution has been obtained so far, but a small number of par- 
ticular solutions have been discovered. A brief summary of 
the calculations involved will be given in this chapter, but 
readers interested in the general literature will be well advised 
to consult the following original papers : 

“‘ Sitz. Preuss. Akad.,’’ 32, p. 688 (1916), by Einstein. 

“ Sitz. Preuss. Akad.,” 7, p. 189 (1916), by Schwarzschild. 

“‘ Sitz. Preuss. Akad.,’’ 18, p. 424 (1916), by Schwarzschild. 

‘““Monthly Notices of the Roy. Ast. Soc.,” 76, p. 699, by 
de Sitter. 

“ Proc. Amsterdam. Acad.,’’ 19, pp. 197 and 447, by Droste. 

“ Ann. d. Physik.,” 54, p. 117 (1917), by Weyl. 

“ Phil. Mag.,” 40, p. 703 (1920), by Wilson. 

“ Phil. Mag.,”’ 41, p. 823 (1921), by Hill and Jeffery. 

A very small acquaintance with the applications of Rela- 
tivity to physical problems reveals the necessity for an altered 
attitude of mind towards the conception of co-ordinates. It 
is manifest in the fusion of space and time. It is brought home 
to us still more markedly at the outset of our immediate 
task. Consider, for a moment, the manner in which we have 
been accustomed to approach the solution of definite problems 
in gravitational mechanics, electrostatics or electromagnetic 
theory. We have, of course, to attempt the solution of a 
certain differential equation, Laplace’s equation or the equation 
of propagation, subject to certain boundary conditions. We 
begin by “choosing a system of co-ordinates,’ they are 
Cartesian, polar, cylindrical, ellipsoidal, toroidal, or any other 
convenient curvilinear co-ordinates which happen to suit the 

270 


EINSTEIN’S GRAVITATIONAL EQUATION 271 


boundary conditions laid down. In each system the equation 
has a distinctive form’ which the mathematician recognises as 
peculiar to that system. If we obtain a formal solution for the 
potentials, scalar and vector, or for the intensities in terms of 
the independent variables, we experience no feeling of doubt 
or difficulty as to the physical interpretation of the solution; 
for we have at the beginning assigned a definite meaning 
to those variables in “‘ choosing ’’ the system of co-ordinates. 

But matters are on quite a different footing if we propose to 
ourselves a similar problem in gravitational mechanics, taking 
Einstein’s equation as our expression for the laws of gravitation. 

Its outstanding property is its invariance, i.e., no matter 
what system of co-ordinates we “‘ choose,”” the equation has no 
distinctive form which we recognise as peculiar to that particular 
system. The result is that if we succeed in obtaining a solution, 
i.e., in discovering functional forms for the g-potentials which 
satisfy the equation, we have no direct guide to a physical 
interpretation of the solution, for any so-called “choice of 
co-ordinates ’’ would have been pointless as a preliminary to 
the purely mathematical problem involved in the solution. 
Such guidance must, in fact, be sought for from the form of the 
solution itself, and not from the form of equation. The most 
obvious clues will be found in the presence of certain singu- 
larities, in the absence of certain variables, and in the degenerate 
form of the solution when the approximations sufficient for 
Newtonian Dynamics are introduced. 

Bearing these considerations in mind, let us proceed to 
obtain an exact solution of the equation “ outside matter ”’ 

Gu =.0 
in the manner first indicated by Schwarzschild. It so happens 
that the clues mentioned above indicate that when physically 
interpreted it is the mathematical expression for the field of a 
single gravitating body. Of course, we can temper somewhat 
the rigorous relativist attitude of mind indicated above by a 
preliminary speculation concerning the form we might expect 
such a solution to take, if we adopted the system of co-ordinates 
which our acquaintance with Newtonian theory would suggest, 
viz., polar co-ordinates. At a great distance from the origin 
where we suppose the centre of the gravitating body to be 
situated, the element of separation between two events has a 
value whose square is given by 
— 872 — 7°80? — r? sin? 056? + df?, Pie bs) 

‘where the usual significance is attached to the variables 7, 0, d, t. 
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It is plausible to assume that at a finite distance from O, 
the value of 5s? would be given by an equation 


Ss? = — y,5r? — y7956? — yr? sin? 056? + x,5/?, . (2) 


where ¥1, Xs, X3, X4 are functions of the variables. 

Analogy with the well-known solution in Newtonian theory 
and the consideration that the field is ‘‘ stationary,” i.e., not 
changing with lapse of time, would suggest that none of these 
functions would depend upon ¢, and that of the space co-ordi- 
nates, ~ alone would appear in their formal expressions. 
Furthermore, comparison with (1) would show that each of 
them would approach the value unity as 7 approaches infinity. 
And, finally, if the solution is to be symmetrical around the 
origin (a very natural hypothesis), and therefore independent 
of an orthogonal transformation of axes, the functions y, and 
x3 should be identical. 

As stated, these are but speculations as to possibilities. 
What we must now do is to discover if such a solution can be 
found. 

Einstein’s equations written fully are: 


d{Hv, a}/d%u — (ma, BHYB, a} + {uv, a}d log g/d%q 
— d® log g/dx,%, = 0. (3) 


Now it is obvious that if we obtain one solution of this 
equation, we, at once, can obtain any number of solutions we 
please by transformation of co-ordinates. (This is not to be 
taken as saying we can obtain a general solution.) We can 
make use of this fact by imposing a certain simplifying restric- 
tion in the solution to begin with, which may be removed later 
by a transformation. Several such restrictions have been 
suggested by different investigators. The one suggested by 
Einstein, and also adopted by Schwarzschild, is 


Gal (= 6) Tip a) cae) 
If we introduce this, (3) becomes 
d{uv, a}/dx. = {ua, B}VvB, a}. , x (5) 


In order to make the adoption of condition (4) as convenient 
as possible, Schwarzschild uses variables x, %2, %3, %4, which 
are later interpreted as follows :— 


%,=75/3, %.=— cos, 
X3 = d, X4 = ibs 
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The point of this interpretation i is this. Ata great distance 
the element of separation is from consideration of (x), given by 


— (3%) — 88x,? — (3x,)#(r — X 9%) — 18% 5? 
— (3%,)#(1 — x,%)dx%,2 + 84,2. . : : . 1(6) 


In these variables g,; = — (3x,) —', etc., and the condition 
(4) is satisfied at a great distance from the origin, which is not 
the case for the variables used in (1). At finite distances from 
O, the equation (2) transforms to 


— fx(%1)8% 1" — fo(%1)(I — %9") —18%_? — fa(%1)(I — %52)5x,2 
+ fia(%3)%4? (7) 


where (remembering that x, = x3) 


fi > (3%1) ~ Fas %, > 
fe > (3%1)3 ”? ” 


and fiz I ” ” 


(8) 


and the condition (4) implies that 


Sifts = I. . . . - (9) 


Thus we have 
ees a2 Catia shan SAL — ha") Bea J 
fay = Oif p = v 


Also, a simple calculation gives 


s = f2fi/g = — iff, since g = — 
e = (L —-4e)/f a, ee ey 5 et4 = I/f, 


eg = Oif w+ v. 


i 


The next step is to calculate the Christoffel symbols. A 
considerable number of these are zero. Thus it is easy to show 
that [uv, 1] = 0, if w+ v; the four surviving [p, I] are: 


[11,1] = — 34f,/dx, 

[22, 1] = 44f./dx, . (I — %,")—* 
[33, 1] = 34f2/dx, . (I — **) 
(44, I] = 34f./dx,. 


Of the [uv, 2] symbols four are not, zero. They are: 


(12, 2] = — 4df,/dx, = [21, 2] 
(22, 2) = Fetal — 43) * 
(33, 2] = — fo%a 

18 
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Only four of the [uv, 3] and two of the [y», 4] survive, viz. : 


(13, 3] = 


[23, 3] = feve = [32, 3 
[14,4] = alae, = (at, 4). 


It is now a straightforward matter to calculate the symbols 


of the second kind. They turn out to be as follows :— 


120; 1} a af dx, (Ahaha 


{22,1} = — df,/dx, . [af,(I — %2")]— 
{33,1} = — df,/dx, . (I — %2")(2f,)~* 


{44, 1} = dfs/dx, . (2f;)—} 

{12, 2} = df,/dx,.. (2f.)—1 = {21, 2} 
{22-2 ye Xia( Ti — 2057) a 

{33, 2} = x2(I — %,") 

{13, 3} = df,/dx, . (2f2) —1 = {31, 3} 


{23, 3} = — x,(I — ae 1523) 


{14, 4} = df/dx, . (2f4)—1 = {41, 4}. 


The remaining {uv,A}-symbols vanish. 
Inserting these in equations (5), it will be found that if 
p+ v they are identically satisfied. So the equations become 
four in number, 


>{up, a}/dva = {ua, BYuB, a}, 


— df ,jdx, .(1 — %_*)—? = (31, 3] 


where pu is not summed but put equal to I, 2, 3, 4 in succession. 
For convenience, write $, = log fj, etc., and we find after 
a few steps that these equations become 


(= 1) d%ps[dx2= 


( 
(u = 4) d(er— 


= 3) This repeats equation (11). 
1d 4/dx1)/dx, = e%— t1(dd,/dx,)%. . 


Also from (9) we obtain 


dp,/dx, + 2dp,/dx, + dp,/dx, = 0. 
From (12) we obtain 


so that 


and 


ad ,/dx,? = ad,/dx, . Ad 4/dx, 
dd,/dx, = der; 


af jax, = afi fy . 


where a is an integration constant, 


(4p 4/4 )*+ (dp o/dx3)?+-43(dd4/dx,)?. . 


(u = 2) d(ers— tid /dx,)/dx, = 2 + 0% —%(dbaldx)? 


(15) 
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Adding (10) and (14), we have 
a*(P, + $4)/dx,? = (dd,/dx)? + $[d(b, + $4)/dx4), 
so that by means of (13) we obtain 
2d*./dx 1" +- 3(ddbs/dx,)? = 0. 
An integration of this yields 
ddb,/dx, = 2/(3%, + 0) 
where 0 is another integration constant, so that 
2 = §log (34, + 4) +6, 
or fe = ¢(3%1 + O)8, 
c being a third integration constant. 
The conditions (8) require c to be unity, so that 
fe = (3% + OF. ‘ Ate) bus), 
Referring to (15), we see that by (9) 
af,jdx, = alf,? ; 
ae al(3%*4 — b)5, 
Rg ABN tO) 8 oe S(T) 
since fy >I asx—>o. 
From (9) it follows that 
fi, = 1/(3%1 + 4)[(3%1 + 0)? — a]. . - (18) 


We have not made use of equation (II) in obtaining these 
values for f;, f., and f,, but it is easy to show that it is satisfied 
by them. ; 

To sum up, Schwarzschild’s solution of Einstein’s equation 
as 


and thus 


— (3%, + 6)—1[(34%, + 0)#—a]-} 
Sata Ut = 4) (x9) 
Bas = — (3%, + O)8(x — x52) ae 
Ga, = I — (3%, + b)—# 


Before proceeding, it may be as well once more to emphasise 
that this solution, as such, is the result of purely mathematical 
reasoning, and is in no way dependent on any physical inter- 
pretation of the variables. We are not even bound to assume 
that the variables do actually represent the co-ordinates or 
-functions of co-ordinates, which we introduced above as an 
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assistance towards obtaining this solution. It is, however, only 
natural to make this assumption and test its conclusions, i.e., 
interpret x, as 73/3, %, as — cos 0, x, as ¢, and %, as?. 

If we do so, and for convenience introduce the quantity Rk, 
which is equal to (7? + 6)?, we find that 


8s? = (1 — a/R)8t2 — 8R2/(x — a/R) — R802 — R? sin? 6342(20) 


The astronomical consequences of this solution will be 
discussed in the second part of this chapter, where it will appear 
that comparison of observation with calculation is not so 
precise as to indicate whether in (20) R should be put equal to 
y or not. 

This is but another aspect of that vagueness about the 
interpretation of variables which can only be cleared up in this 
case by sufficiently refined measures. True, if we care to make 
the problem a little more precise, and by adopting a procedure 
which is common in Newtonian dynamics, say that our gravitat- 
ing body is a “ particle,” i.e., possesses no dimensions, we can 
remove some of this indecision about the meaning of our 
co-ordinates, for in such case we could naturally expect our 
solution to be finite and continuous everywhere except at the 
origin, y= 0. A glance at the second term on the right-hand 
side of (20) shows us that this is not so unless a8 = b; for Ris not 
equal to a, where vy = 0, if a3 + 6. Without some such proviso, 
however, there is no obvious relation between a and 6. The 
most natural physical interpretation for the constant a is that 
it is a measure of the “ strength ”’ of the gravitating body, and 
it will appear that it is equal to twice the Newtonian gravitating 
mass as far as any available measurement can tell. 

But as for 0, apart from the necessity of satisfying some extra 
condition such as that mentioned above, there is nothing to 
decide its value. It must be undoubtedly small compared, say, 
to the actual diameter of the sun. If we choose to put it equal 
to zero, we obtain 


ds? = (1 — a/r)dt? — 5r?/(1 — alr) — 7°50? — 7? sin? 0862, (21) 


the equation actually used in discussing the astronomical 
evidence. This, of course, is a solution which can only be 
considered as valid outside a sphere of radius a, and so would 
suggest a size for a gravitating ‘“ particle’ closely connected 
with its mass, a conclusion which is not absurd, but is not 
verified by any known result. 

But, of course, as stated above, there is really no limit to 
the number of forms which may be suggested for the element 
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of separation in the field of a single body. Thus, reverting for 
the moment to (2), viz., 


— x57? — Xo(7°50? + 7? sin? 034%) + x54 


(using the condition y, = x3), we see that Schwarzschild’s 
solution satisfies the condition 


XiX2"Xs = — I, 
which is really Einstein’s condition, g = — 1. 
Now Hill and Jeffery have recently shown that an exact 
solution can be obtained which satisfies the different condition 


S.6Ne 3 Bay = DEX 


The details of the solution can be worked out in a manner 
similar to that given above, and are to be found in the reference 
at the beginning of the chapter. But, as a matter of fact, Hill 
and Jeffery’s solution can be obtained from Schwarzschild’s by 
the transformation 

y= 7,(I + a/47,)?, 
b being considered as zero. On dropping the suffix after the 
transformation, we find that 
X1=X2 =X = (I+ a} ; (22) 
Xa = (I — a/4r)?/(I + 4/47)? 
It is not difficult to obtain forms of the solution suitable 
for Cartesian co-ordinates, for since 
dv? + 77862 + 7? sin? 0562 = dx? + Sy? + 82? 
and (vdr)? = (xdx + ydy + 262)?, 
the solution (21) can be written 
(xr — a/r)dt? — (dx? + dy? + 62?) 
— a(xdx + ydy + 262)?/73(1 — a/r). 

This gives a scheme of g-potentials which can be easily 

written down, e.g., 
= — (I+ a'x*/7*), 24, = — a'xy/r*, etc., 


where a’ = a/(I — a/r). 
Hill and Jeffery’s solution can be Emily transformed into 
the rather elegant Cartesian form 


(x — a/4r)(x + a/4r) — 2dt2 — (I + a/4r)*(5x* + dy? + 982%). 
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The latter also gives a very useful approximate solution, 


ViZ., 
(1 — 2)d¢2 — (1 + 24)(dx? + dy? + 82%), 


where ¢ is the usual Newtonian potential function. As a 
matter of fact, Eddington makes a very interesting and 
ingenious use of this form in order to obtain Einstein’s equation 
inside matter. The details are in Chapter VI. of Eddington’s 
“ Report,” and the reader will find it instructive to compare 
this method with Einstein’s own, which was given in Chapter XI. 
above. 

For further details, and especially for solutions of the 
equations at a point within matter, the reader should consult 
the references given earlier. 


ASTRONOMICAL CONSEQUENCES OF EINSTEIN’S EQUATION. 


Equations of Motion of a Particle in the Field of a single 
Gravitating Body. 


It will be rather more convenient to use the “ Lagrangian ”’ 
form of the equations of motion, viz., 


A(dL/d%,)/ds — dL/dx, = 0 : 3) 
where ; ; 
2L = yi? — y— 37? — 7°02 — 7? sin? 04? 


and we employ the more usual symbols instead of %1, %», %3, %4 
and write y for 1 — k/r, k being a constant. 
The fourth of the equations (23) is 


a(yt)/ds = o. 
Hence yf = a constant = 8, 
or bds = (1 — R/r)ot. : ‘ a (2a) 


The third of the equations is 


d(dL/d4)/ds — IL/34 = 0, 
Le, d(r* sin? 6¢)/ds = 0 
so that v? sin? Of = a constant. 


We can, without loss of generality, take this constant to be 
zero, and put 


dd/ds = 0, 
or ¢@ = a constant. 
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This means that the motion is in one plane. 
Turning to the 6 co-ordinate we have 


d(dL/06)/ds — dL/90 = 0. 


Now dL/d0 = — r* sin 6 cos 04? 
= 
Therefore v?6 = a constant, 
or Pavids = hh... : ; : : . (25) 


This result is the analogue of the law of equal areas in 
Newtonian theory. 

There is no necessity to trouble about the remaining 
equation ; (24) and (25) combined with the form for ds? are all 
we require to obtain the differential equation of the orbit. 

By (24) 

dt/ds = bly. 
Hence, using the form of $s?, we obtain 
y(b/y)? — y—1(dr/ds)? — r*(d0/ds)? = 1. 
Using (25) we have 
(dr[ds)? + yh?/r? = b? — y. 
But (25) also shows that 


drjds = h/r? . dr/dé. 
Hence 
h?/y* . (dr[d0)? + yh?/r? = b? — y. 


As is usual in the mathematical analysis of motion “ under 
central forces,’’ we introduce a symbol for the reciprocal of the 
radius vector, 1/7. Call it wu. Then 


(du/d6)2 + (x — ku)u® = uh[h? + (b? — x)/h?. 


This gives a differential equation for the orbit in terms of plane 
polar co-ordinates ; but a further differentiation with respect 
to 6 will give us a differential equation differing by one term 
from the familiar second order equation for motion round a 
centre attracting as the inverse square Differentiating in this 
way, and cancelling out du/d@ from each term, we obtain 


d?u/d0? + u — sku? = k/2h*. : 720) 


As mentioned, we are accustomed in Newtonian theory to 
the equation for an orbit in the form 


aufde?+u=Mhr* , : nn) 
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where M is the gravitational mass of the attracting body. As 
Newtonian theory is such a good approximation, we can avail 
ourselves of this to identify the constant & in Einstein’s orbit 
equation as 2M, so that numerical tests become possible. So 
we write (26) as 


au/de? + u— 3Mv= M/h*. . «1 (2G) 


The exact solution of (26A) involves elliptic functions, but 
on account of the smallness of the third term in comparison with 
the others, in the case of any of the planetary orbits in the solar 
system, an approximate solution in trigonometrical functions 
sufficiently accurate for immediate purposes can be obtained.* 
Before working it out, we shall exhibit the orders of magnitude 
involved. 

The earth’s mass is about 5°5 times the mass of an equal 
globe of water (6400 kilometres in radius), i.e., about 6 x 1074 
metric tons, or $ of the Relativity astronomical. unit of mass.f 
The sun’s mass is about one-third of a million times the earth’s, 
ie, about 15 x 10° Relativity units, or M —=1°'5 xX 10%, 
approximately. The average distance of Mercury, the inner- 
most planet, from the sun is about 60,000,000 kilometres, or 
6 x 1012 cms. So for the orbit of Mercury Mu (ie., M/r is 
about 2°5 X 10-8). In Newtonian theory h?/M is the latus 
rectum of the ellipse obtained from equation (27), and so 
h?/M is of the same order of magnitude as the radius vector 7. 
Consequently the extra term in (26A), 3Muw?, is about 10~7 of 
any of the terms in the Newtonian equation, and so in a solution 
we can neglect terms involving the square of this ratio. 

Writing (264) as 


d*u/d6? + u= A+ 3M u?, 


where A = M/h? and is the reciprocal of a length, we see that 


if we neglect the term 3Muwu?, we obtain as an approximate 
solution 


u = XI + ecos 8), 


* For an exact solution, see a paper by Forsyth (‘“ Proc. R.S.,” 
A. 682 (1920)). 

} In Newtonian theory 2M/r is the square of the velocity ‘‘ from 
infinity,’’ and so units of M must be chosen to suit our unit of velocity, 
which is ¢ cms. per sec. Consequently, 7 being in cms., M is not expressed 
in the usual astronomical unit of gravitational mass (that which concen- 
trated at a point attracts another unit with force 1 dyne at a distance 
rcm., about 15 X 10° grams or 15 metric tons), but in an enormously 
greater unit, viz., c* times the usual astronomical unit or 1:35 x 1078 
grams, or 1:35 X 10” metric tons, 
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which is the polar equation of a conic with \ as the reciprocal 
of its latus-rectum and e as its eccentricity, the initial line 
being so chosen as to pass through the perihelion of the planet, 
where 7 has its minimum value and uw its maximum. 

To proceed to a second approximation, we put this value 
in the term 3/1? of (264), and obtain as the differential equation 


d*u/d0? + u=X-+ 3M + 6MA?%e cos 0 + 3MA?2e? cos? 6 
=A+ 4+ vcos6é+ «cos 26, 


where 
p= 3M (1 + 4de?)A? 
v = 6Mer? 
Kk = 32Me?)2. 


ft, v, « are obviously of the order 10-7 A at most, v and x 
being even smaller if the eccentricity of the approximate 
ellipse is small, as is the case for nearly all the planets in the 
solar system. 

This is a well-known type of differential equation, and its 
solution is 


u = (A + p)(t + ecos #) + dr0 sin? —4xcos20 . (28) 


as can be verified by trial. Of course, we could write any 
arbitrary constant where we have written é, so far as satisfying 
the differential equation is concerned. Naturally, however, 
the solution must degenerate into the Newtonian ellipse if we 
neglect w, v, and x. 

Equation (28) represents an orbit slightly perturbed from 
the ellipse 

u = (A+ p)(I + écos 8) 


by amounts depending on the remaining two terms on the 
right-hand side. Of these, the term 36 sin @ produces ulti- 
mately the greater effect, for it may, during the course of the 
(1 + 1) revolution, after the initial instant, acquire a value 
+ nvr, which increases with 1, while the other term never ex- 
ceeds $« numerically. 

The interesting point is to determine where the perihelia 
occur. In the case of the ellipse, they occur at 6 = 0, 27, 47, 
etc. (1, 37, 57, etc., corresponding to aphelia). To obtain their 
angular position in the perturbed orbit, we must choose the 
alternate solutions of 

du/dé = 0, 
1.€., 
f(A + pe — Zr} sin 6 — 3x sin 20 = 40 cos 0. 
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There will be a solution near 9=2mn7; so putting 
§ = 2nm7 + a, we obtain on writing a for sin a, and unity for 
COS a, 
{0+ we —» — 4x/3}a = nvm, 
or practically 
a/n = mrv/de | 


mao (29) 


Equation (29) gives an advance in the perihelion per revolu- 
tion of the amount 6MAz radians. Now M = h2),, and X is 
the reciprocal of the latus-rectum of the approximate ellipse, 
which is equal to b(1 — e?)t, where 0 is the minor semi-axis. 
Also, referring back, h is practically twice the area swept out 
per unit time by the radius vector, i.e., h = 27ab/T where a is 
the major semi-axis and T is the period of a revolution (in 1/c 
second as unit). 

Hence the advance of the perihelion per revolution is 


247°a2 
T(r — e?) 


This result has received a most remarkable confirmation 
in the case of the planet Mercury. Its value when multiplied 
by the number of revolutions in a century is 43 seconds of 
arc. By Newtonian theory the perturbing action of the 
remaining planets should cause the perihelion of Mercury to 
advance 532” per century. The mass of any one of these planets 
is so small compared to that of the sun, that the treatment of 
its gravitational field at Mercury by the Einstein method 
would yield results only differing from those obtained by the 
earlier method by amounts much too minute to be perceptible. 
Hence Einstein theory would still give 532’ as the perturbing 
action of the other planets. But the observed value is 574’ 
per century. It was Le Verrier who first pointed out the 
discrepancy, and no explanation entirely acceptable to astro- 
nomers seems to have been forthcoming, until Einstein pointed 
out that the difference was accounted for by his treatment of 
the gravitational field of the sun. By its action alone on his 
view there is an advance of 43’’, to which is added, of course, 
to the 532” due to the other planets, and close agreement with 
observation is thus obtained. The arresting feature of the 
result is the absence of any adjustable constants in the formula. 
The values of a, T, e are well-known astronomical data. Of 
course, there must be similar advances in the case of the other 
planets, but they are smaller, being about 8” per century for 


radian. . ; ; . (30) 
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Venus, about 4’’ per century for the Earth, and still smaller 
values for the remaining planets. Apparently, it is next to 
impossible to test these results. It seems that astronomers do 
not directly observe the advance, but the product of the 
advance and the eccentricity. This product is about 8” in the 
case of Mercury, whose eccentricity is ‘2, but the eccentricities 
of Venus and the Earth are so small that the products in their 
case are only about one-twentieth of a second. 

It will be as well to give a little further thought to this 
result before proceeding. In (28) we have the polar equation 
of an orbit. Interpreted in the usual way 7 represents the 
distance from the sun, @ the angle from a certain initial line. 
We might consider this orbit drawn absolutely correctly to 
scale on a large sheet of paper. A series of adjacent points 
P,, P., Ps, . . ., Pm could be marked on the curve ; each one 
would have an appropriate 7 and 6, and by means of (24) and 
(25) each one could have an appropriate ¢ marked beside it. 
Yet we on the Earth should not actually obtain ¢, — ¢, as the 
time of passage of the planet between the positions represented 
by P, and P, on our plans; we should obtain y(t, — ¢,). 
Neither should we get 7, — 7,, as the radial displacement ; we 
should get (7, —7,)y—*. Our measure of the displacement 
transverse to the radius vector would agree with the plan. 

This is so, because if dt’, 57’, 7’50’ are our measures made in 
our own terrestrial frame, we must write 


ds? == dt'2 — 67’2 — 7/266"2 
(ignoring the ¢ co-ordinate as of no importance in a plane 


orbit), because our axes are Galilean and our instruments are 
fixed in this frame. So to obtain the form for 6s? we must put 


dt’ = ytdt 
OY = 47 40r 
750’ = 780. 


This reveals a lack of agreement between our measures 
and the plan. There are two ways of describing this discre- 
pancy. One is to say that ¢ is the time which would be given 
by a clock at rest in the solar gravitational field, and not acceler- 
ated like ours, and 7 and @ are measures obtained by rulers, 
protractors, telescopes, etc., also at rest in the gravitational 
field. Compared to such a clock our actual clocks go slow, 
because, since y<1, they give a smaller number than 9¢ for 
their measure of a certain interval of time ; compared to such a 
- ruler, our actual ruler, if held parallel to the solar field, is short, 
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since it gives a larger number than 6r for a certain radial 
measure ; if held transverse to the field it agrees. This way 
of looking at the matter appeals undoubtedly to our feeling 
that we must have something tangible for gravitation to exert 
an effect on. It no longer is a matter of exerting force on a 
body, but the field exerts an influence on our metrical relations 
with bodies in the field, which we phrase most easily for our- 
selves by saying that it makes our clocks go slow and shortens 
our rulers if held parallel to the field. But, of course, such 
running slow and shortening has nothing absolute about it—it 
must be purely relative to clocks and rulers disposed in a highly 
unnatural manner, and assumed to be free from what is other- 
wise a universal behaviour. In this connection it is very 
essential to bear in mind that it is the Newtonian potential 
and not intensity of gravitational force which determines the 
effect on our measures. 

The other way of looking at the matter is one which cannot 
be explained without some preliminary discussion on the 
possibility of our space-time continuum being curved or 
“warped.” But if, instead of drawing the curve given by 
equation (28) on a plane sheet of paper, we drew it on a 
curved surface, the lengths of geodesic lines from the focus to 
the curve being represented by 7, we might by properly choosing 
the surface make our measures of length fit with the dimensions 
of the plan; but the difficulty about the measured time would 
still remain. We shall return to this matter later. 

Let us once more take up the analysis and consider the 
velocity of propagation of light in the field. This is determined 
by the condition 5s = 0, i.e., 

yot? — y— 1672 — 72602 =0 . : . 3x 
in a given plane. 

Hence if v be the velocity of light as determined by the 
co-ordinates 7, 8, ¢, 

v2(cos? w + ysin? f) = 2 . : k(32) 
where % is the angle between the direction of the light at a 
point and the radius vector from the gravitating centre to the 
point. 

Along the radius vector this gives 

v=y=1—2M/r. 
At right angles to 7 it gives 
(hs yt 
= 1 — M/r approximately, 
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This gives the velocity as different in different directions ; 
but, of course, this velocity is not the actual measure made by 
observers, who are naturally always in a Galilean frame. We 
should again have to postulate observers unnaturally fixed 
with their instruments in the field. Actual observers get 
d¢’(= ytdé), and not & for the measured time of the light 
between two points, and 7’80’(= 786) and 87’(= y~ 46r) as the 
components of the distance between the points, so that for them 
the speed of light is 


(S72 + 7'280’2)4/52’, 


which is unity. Yet in dealing with celestial phenomena, such 
as planetary motion, we refer an orbit to the sun as centre, 
and so we choose axes with the sun as origin, and referred to 
these axes the path is no longer the ideal ellipse with planetary 
perturbations superposed. In a similar manner, if we choose 
the sun as origin, the path of a ray of light through the solar 
system is no longer straight if we derive the co-ordinates of 
points on it from measurements made by us on the earth, using 
Einstein’s law of gravitation and the values of the potentials 
inferred fromit. For since vis variable, the wave from it must 
slew round. Let us calculate the bending for a ray of light 
which passes a gravitating centre at a nearest distance a. 

Choose axes so that OY passes through A, the nearest point 
of the ray to O, where the centre is situated, and so that OX is 
parallel to the direction of the ray at this point. Choose 
two consecutive points P and P’ on the ray, and let PQ and 
P’Q’ be elements of normals to the ray, so that Q and Q’ are 
consecutive points on an adjacent ray. The angle between 
PQ and P’Q’ is the amount of deviation of the ray between 
P and P’. This angle is 


(QQ’ — PP’)/PQ. 


If v + Sv is the velocity of Q and PP’ = 3s, so that the 
time between P and P’ or Q and Q’ is 6s/v, this angle is equal to 


dvds/(vd2) 
where 
on = FO: 
Hence the amount of bending of,the ray is equal to the 
integral 
Jo-idvfdn ds . ; : 2. £33) 


. between assigned limits. 
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Now by (32) 
v(I — 2M sin? w/v)? = 1 — 2M/7, 


or v=1—2M/7r+M sin? $/r 4- higher powers of M/r (34) 
= 1 — M(1 + cos? #)/7 + etc. 34 


Combining (33) and (34) will give the required result. In 
the case of a ray passing near to the sun, M = 1°5 X 10°, and 
a (taken as the sun’s radius) is approximately 8 x 10°, so that 
the maximum value of M/r is about 2 xX 10-5. Asv—1dujdn 
is of the order M/r?, it is clear that all but a negligible part of 
the bending is contributed by the part of the ray lying between 
two points equidistant from A and situated at a distance, say 
1ooa, from it (roughly half the distance of the earth from the 
sun). The bending is so small that the ordinate y of any point 
on this portion of the ray may be assumed to be a without 
practical error; also ds may be replaced by dx and d/dn by 
d/dy in (33). Hence the total bending of a ray coming from a 
great distance, passing the sun and travelling to a great distance, 
is equal to 


2fv- 1v/dy . dx 
= 2/ [M(x + cost #)/r]/2y . dx 
= 2/s(Mir 4+ Me2/r8)/ay . dx 
= 2M/lajr + 3ax%/r) de, 


since 07/0y = 4/? = a/r. 
Putting 
“= atan o 
vy = asec 0 
dx = asec? 648, 


we see that the bending is equal to 


t/2 
2M /a . /(cos 8 + 3 sin? 6 cos 6)d0 
0 
= 4M /a. 
It is interesting to notice that a bending of light would be 
quite conceivable on Newton’s corpuscular theory, and, in fact, 


speculation on such a possibility did engage his attention. 
Were the hypothetical corpuscles subject to the gravitation of 
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stellar bodies according to his law, they would travel along a 
branch of a hyperbola of very great eccentricity in passing such 
a body, the speed of the corpuscle increasing as it approached 
the gravitating mass, the very reverse of Einstein’s conclusion. 
Further, it is easy to show that in this case the bending would 
be 2M/a, half of that predicted by Einstein. For example, 
the bending would be the acute angle between the asymptotes 
of the hyperbola. Now on Newtonian theory 


v? = 2M/r + the square of the velocity at infinity 
= 2M/r-+1. 


But v? is also 2M/r + M/R where R is the major semi-axis 
of the hyperbola. 


Hence R= M, 
But a= (e—1)R, 
where ¢ is the eccentricity. 
Therefore e=1+a/M 
= a/M practically. 


Since the acute angle between the asymptotes is 2(e2 — 1) —4, 
or 2/e practically, we obtain for the bending 2M//a. 

It can also be shown that, using the electromagnetic theory 
and assuming that radiation has a mass subject to gravitational 
action, the result 2M/a is again obtained. (See Eddington’s 
“ Report,” Chapter V., pp. 55, 56.) 

In the case of the sun, 4M/a radian is equal to 1°74 seconds 
of an arc, and it is to this value, and not the half of it, that the 
results of the British Eclipse Expeditions of I919 give great 
support.* 

A third crucial test of Einstein’s work on gravitation has 
exercised a good deal of careful attention both on the theo- 
retical and experimental sides. It is claimed that the theory 
predicts a shift of the Fraunhofer lines in the solar spectrum 
towards the red by amounts equal to those which would be 
given by a radial velocity of recession of 0°63 kilometre per 
second. Now not only is the experimental evidence on this 
matter indecisive at the moment, but some physicists such as 
Larmor, Jeans, and Cunningham have expressed doubt as to 
the validity of the deductions of the result from Einstein’s 
theory. The deduction itself begins by considering some 
periodic mechanism situated at rest ‘in axes attached to a 
gravitating body. Referred to Galilean axes in motion at the 
locality of the mechanism with the acceleration characteristic 


* Completely confirmed by the American Expedition investigating 
the eclipse of September, 1922. 
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of the place, its period is 8, let us say, and the mechanism has 
a displacement 5x, Sy, 6z in these axes during the time 6. 
Then, as we have seen, 


8t2 — Sx? — Sy? — 822 = 10X27 +. . . + 28340%30%4 


where 6x, is the period of the mechanism in the gravitational 
frame, and 8x,, 6%», dx, its displacement in the time 6%,. Ifthe 
mechanism is at rest in the latter axes, 6%, = 0 = 8x, = 6x, and 


522 — bx? — dy? — 622 = g4,6%,?. 


The left-hand side of this equation is 6/2(1 — v?), and its 
square root is the proper time of the period, i.e., the period as 
determined when the mechanism is at rest in a frame free from 
gravitation. Now suppose that an exactly similar mechanism 
is situated at rest at another place in the gravitational field 
%1', Xe’, x’, then we can prove that the proper time is also 
equal to the square root of 

844 0X4" 


where 5x,’ is the period of the mechanism in this new position.* 
Hence 


2 — td , 
Bg0% 2 = Bqq'd%,'? 


and so the periods of the mechanisms would differ if situated 
in different parts of a gravitational field, although they would 
agree if they were brought together. 

In making a test of this conclusion, it is assumed that the 
radiating mechanisms in two atoms of the same element, 
sodium, for example, or cyanogen or nitrogen, are such 
mechanisms as we have referred to, i.e., that for all atoms of 
one kind the interval 5s for a given spectral line is the invariant 
thing which remains unaltered by displacement from one place 
to another. It is this assumption which has had to endure a 
certain amount of adverse criticism. Choosing axes fixed in 
the sun, and considering the first atom as fixed on the surface 
of the sun, and the second on the earth, we have, making use 
of the expression for 6s?, 


(1 — 2M/r,)dt,2 = (1 — 2M/r,)dt.2 — (1 — 2M /r,) — 187.2 
where 573, 7,58, is the earth’s displacement in the time 8f,. 


* The reader is warned that the accented symbols refer to another 
event at a different place in the same axes, and not the same event 
referred to another system of axes. 
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In this equation, 7, being the sun’s radius and 7, that of the 
earth’s orbit, it appears that 2M/r, is negligible compared 
with 2M/r, ; further, 87,/8¢, and 7,86,/8¢, being the components 
of the earth’s velocity, their squares are also of the order M ifs, 
and so are negligible. Practically, we have 


dt, = (1 — 2M /r,)*8t, 
= (1 — M/r,)3t,. 


That is, the period of the terrestrial atom is a little less than 
that of the solar, and so the solar lines should show a displace- 
ment to the red as compared with the terrestrial. The quotient 
of the solar mass by its radius (in Relativity units) is about 
2x Io~%, In that part of the spectrum in which investigation 
has been made, the displacement should be about ‘oo8 Ang- 
strom units. As solar lines are subjected to many disturbing 
influences such as the pressure-shift, the Déppler effect due to 
ascending and descending masses of gas for which it is difficult 
to make allowance, the practical testing of this conclusion is 
beset with every serious difficulty. In order to eliminate 
spectral displacements arising from differences of pressure 
between the sun’s reversing layer and the terrestrial arc, special 
attention has been paid to the lines in the cyanogen band at 
A = 3883, which are known to be free from the pressure effect. 
Further, since ascending and descending currents of gas would 
cause no first order Déppler effect at the sun’s limb, observa- 
tions have been made at the sun’s limb as well as at the centre 
of the disc, and the Déppler effect due to the sun’s rotation has 
been avoided by employing the parts of the limb near the sun’s 
poles. But there still remain unsettled points such as the 
weight to be ascribed to results derived from weak lines, and 
as compari d with those derived from strong, and the question 
of a displacement of the cyanogen lines towards the violet due 
to the proximity of metal lines, a matter which has received 
some attention in the researches of Grebe and Bachem. While 
the results of St. John in America in 1917 are distinctly 
unfavourable, those of Evershed in 1918 and Grebe and Bachem 
in 1919 offer considerable support. A mean of all the known 
experimental results, which have sufficient weight, gives ‘004 
or half the calculated value. 

The position is certainly not satisfactory, as the deduction 
is certainly not convincing to some minds, and the experi- 
mental evidence gives no decided lead one way or the other. 
It should be stated that Einstein himself maintains that his 
whole theory of gravitation stands or falls by the success or 


19 
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failure of this test; on the other hand, some physicists like 
Jeans maintain that if we do not insist on the special inter- 
pretation of ds made by Einstein (which we will meet later 
when treating of the four-dimensional space-time continuum), 
the theory can still hold, and embrace the extended planetary 
motion and the curvature of light ; it will become “less rich 
and less inclusive, but at the same time less beset with diffi- 
culties.” Those interested wiJl find this matter dealt with 
very fully in a discussion on Relativity held by the Royal 
Society, and printed in the “ Proceedings,’’ Vol. A 97, No. 681, 
March, 1920. 

The pages of ‘‘ Nature’’ during the early months of 1920 
also contain some interesting articles and letters in which this 
question, among others, receives a good deal of attention. In 
this connection it should be mentioned that in the same volume 
of this journal alternative explanations of the Einstein curva- 
ture of the light ray are put forward, e.g., the possibility of 
a refracting atmosphere round the sun, or, as suggested by 
Professor Anderson of Galway, a refraction due to the passage 
of the light on its way to the camera into the (possibly) cooled 
column of air in the eclipse shadow. Both these suggestions 
are adversely criticised (and in the writer’s judgment success- 
fully disposed of) by Crommelin and Schuster. 

The matter of the spectral displacement is so important that 
an alternative, and somewhat more elementary way, of stating 
the proof may not be out of place, which avoids bringing in the 
symbolism of the proper time and exhibits rather more clearly 
the special parts played by the general principle of Relativity 
and the principle of Equivalence as distinct from one another. 

Suppose we have two identical sources of light at points 
A, and A, on the axis OX of a frame of reference free from 
gravitation. An observer at O obtains the same period T for 
each. If we now consider an observer situated in a frame of 
reference moving uniformly with respect to the former with a 
velocity “ parallel to OX, he would also observe that the 
periods are equal. They are, of course, no longer JT, but 
T/a(i + %) by Chapter II.; but they are still equal to one 
another, and this equality is independent of the relative velocity 
of the frames. If General Relativity is true, the equality should 
still persist even if the second frame of reference were acceler- 
ated along OX. No doubt the periods would be varying 
according to the Déppler effect, but the observer would perceive 
equal frequencies at one instant. In fact, in the first frame, 
the front of a train of waves from A, arrives at O simul- 
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taneously with the front of a train of waves from A,, and the 
end of the first train also arrives simultaneously with the end 
of the second. But the coincidence of two events at the same 
time and place in one frame remains a coincidence in any 
other frame. So these two synchronisms remain as syn- 
chronisms in any other frame ; no doubt the interval involved 
between the arrival of the front and back of each train will 
alter with the frame employed, but in a given frame it will be 
the same for each. Thus far we are employing nothing but 
the principle of Relativity. Now comes the principle of Equiva- 
lence. In the accelerated frame the sources are moving with 
the same acceleration parallel to OX, either to or from the 
observer at O, and maintaining a constant distance between 
them (i.e., relative to an observer fixed on either). Hence, if 
Equivalence is true, these two sources falling in a field of gravi- 
tation will emit light beams which will have the same frequency 
as received by an observer fixed in the field. The frequencies 
will no doubt be increasing as time goes on, on account of 
increasing speed of the sources towards observer, but at any 
one instant they are equal. Let T now stand for the common 
value of this period as received by the observer at a given 
instant when the sources are at OA, = 4%, and OA,=x%, 
respectively, and their common speed is v. But the actual 
period of A, is not calculated by Déppler’s principle to be 
T(i + v), but T(r + v — gx,); for the light now reaching O 
was emitted at an instant x, earlier (the speed of light is taken 
as unity, and we are omitting quantities of the second order), 
and the velocity of A, then was v — gx,, so the true period of 
the first source, i.e., the period of the light emitted by it ¢f a 
were fixed at A, would be 


f,=1(1 + 0 — gx). 
Similarly, the true period of the second source zf fixed at A, 
is 
T= 1(1-+ v — g%,). 
Hence, as v, 2%, £%_ are of the first order, 
T, = T,(1 — gi), 


where ] = x, — x,; or the period of the source at the lower 

potential (A,) is longer than that at the higher potential. It 

is noteworthy that while we have ignored certain factors of 

the second order in v, etc., the ratio of the two periods involves 

unity minus a Newtonian potential difference, in analogy with 
soy 
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the factor 1 — M/r arrived at in the fuller treatment of the 
solar gravitational field. In fact, as already stated, it is the 
difference in potential between the two places, as distinct from 
intensity of gravitational field, which determines the difference 
in period of sources fixed in these places. The field might be 
very weak at and between both places, but provided there is a 
large enough difference of potential, the difference in period 
would be marked. When put in this way, there seems to be 
little hope, as Eddington says in his ‘“‘ Report” (p. 58), “ of 
evading the conclusion that a displacement of the Fraunhéfer 
lines is a necessary and fundamental condition for the accept- 
ance of Einstein’s theory ; and that if it is really non-existent, 
under conditions which strictly accord with those here postu- 
lated, we should have to reject the whole theory constructed on 
the principle of Equivalence.* Possibly a compromise might be 
effected by supposing that gravitation is an attribute only of 
matter in bulk, and not of individual atoms; but this would 
involve a fundamental re-statement of the whole theory.” 

In the discussion referred to above, Cunningham suggested 
that the failure to detect the spectral shift, if confirmed, would 
involve some difference between a simulated gravitationa! field 
such as can be entirely transformed away, and a real gravita- 
tional field which is associated with the presence of matter and 
cannot be transformed away in its entirety, but only locally. 
Such a difference would be of such a nature as not to interfere 
with the similarity of the real and simulated fields as regards 
motion of matter or transmission of light. It is certainly note- 
worthy that the difficulty has arisen in connection with emission 
of light, i.e., atomic processes, and as Eddington points out, if 
the displacement of the solar lines be confirmed, it will be the 
first experimental evidence that Relativity holds for quantum 
phenomena. 

As regards other solutions of the equation for a single body, 
one advantage of Hill and Jeffery’s result is that it makes the 
velocity of light at a point independent of the direction of the 
ray, the value being 


(r — a/4r)/(x + ajar)’.  . .(35) 


But it must be carefully remembered that this new expres- 
sion of the separation element leads to no contradiction of the 
known physical measurements, which, be it noted, all concern 


__* The italics are mine. We could still, for all that, retain the prin- 
ciple of Relativity in its most general form. 
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the angle @ and not the co-ordinate 7, about which there is a 
certain indeterminateness in interpretation. Thus (22), leading 
to (35), still gives 4M/a for the deviation of a light ray to the 
order M/a, which is all that can be observed. For the path of 
a particle in the field, (22) gives the equation 


d?u/d0? + u — 6M?u/h? = M/h2, 


which, though different to the equation (26A), derived from 
(21), nevertheless yields the same result,6M?z/h®, for the advance 
of the apse line per revolution. The reader, who is interested 
to pursue this matter a little further, will find some instructive 
and critical remarks by M. Painlevé in the ‘‘ Comptes Rendus ” 
(24th October and 14th November, 1921). This writer explores 
the possibilities of considering a still more general form for 8s?, 
such as 


(x — Rif(r)) 542 — f(r) (86? + sin? 684%) — f'2(r)(x — Rif(r)) ~ 387°, 


where ¢ is interpreted as the radius vector and /(7) is an arbitrary 
function which approaches the value 7, as 7 grows larger. He 
points out that in the actual state of our measures, nothing 
decisive can be established from a consideration of trajectories 
alone. Only very precise observations of times, such as the 
durations of the years of each planet, will yield any data which 
can have a decisive influence on our choice from the alternative 
expressions for 6s?; and even then the planets will have to be 
without satellites if we are to avoid the formidable mathematical 
difficulties raised by the problem of three bodies. Some remarks 
by Dr. Dorothy Wrinch, bearing on the question generally as 
to the correct application of the scientific method in the 
dilemma occasioned by such criticisms as those of Painlevé, 
will also be found very helpful by the reader with a philosophical 
mind. They appear in “ Nature,” 23rd March, 1922. 


PROPAGATION OF GRAVITATIONAL WAVES. 


The field of a single gravitating centre, as expressed in any 
of the solutions obtained above, is a static field from the point 
of view of an observer in the same frame of reference as the 
centre. The field of a number of centres, such as a planetary 
system, will alter with time, i.e. the solution for the g-potentials 
wili involve x, as well as %,, %2, and x3. This is, of course, also 
true in Newtonian theory. But whereas in the latter case the 
conception of an absolute space and an absolute time is vital 
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to the theory (for the potential at an assigned point and an 
assigned instant is calculated from the simultaneous positions 
of the gravitating bodies, and so gravitation is regarded as an 
influence which is propagated at an infinite speed), it cannot be 
so in any law of gravitation consistent with the Relativity 
principle. This point can be further developed by reference 
to an approximate solution of his equation for a general dis- 
tribution of energy, stress, and momentum obtained by Einstein 
in the form of a group of integrals resembling those which appear 
in classical electromagnetic theory as “ retarded potentials.” * 
From what has been stated in the preceding pages, it appears 
that in general the potentials at any point-instant differ from 
the Galilean values : 


ee 10) O O 
O == 1h O O 
O 1) == 18 O 
O oO O aL 


by quantities of a small order of magnitude. Let us write 
Suv a Ouy a Nyy, 


where dy, is the Galilean value of the »-component (so that 
So = 0 1f pay? Oy — Tih py 1, 2, OF SO ee 
if uw = v = 4), and hy, is a quantity of the first order of magni- 
tude. In what follows we shall neglect squares and products 
of the h,,-functions, regarding them as quantities of the second 
order. To this approximation it can be shown by a short 
series of straightforward steps that 


Guy = — dI{pv, a}/IX_ + dI{ua, a}/dx, — {uv, a}{aB, B} 
+ {ua, B}{vB, a} 

= — 20[8°P(28yp/d%y + Myup/éy — Bus] d%p)]/O%a 
+ $0(g%7 0gap/ 9%») /DXp. 


On the right-hand side we can replace gu” by 54” to the same 
degree of approximation, where $1! = $22 = §33 — — 1 and 
644 = 1 and the others are zero. We shall write h,” for g’Itug, 
or practically 8’%/y2. Thus 


hy Saale hus, fhe awe hus, hs ae hus, hy4 = Aus 
and h = hy* = — hy — hog — gg + Naa. 
It follows that 


* Einstein, ‘ Sitz. Preuss. Akad.’’ (1916), pp. 688-696 ; ibid. (1918), 
PP. 154-167. 
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Guy = 3(5°3d*tuy/dXqd%B "9 d7h/ dX pdIXy == Ny /IXpdIXg 
- —' 07y%/d%yd%q) 
= FO + YPH/ dx pdxy — Why! dxpdxX_ — Phu*/dxy%_) (36) 


where we write (J for the operator 
— 07/0x,? — 92/dx%5? — d2/dx, + d2/dx,2. 
Now define f,, by the equation 
Suv = Iu — 48h 


Hie pak o>.) 8B 
and assume that /,, satisfies the four conditions expressed by 
the equation 

Wu.t/I%_ =O. . : : (38) 
If this assumption be satisfied, then we see from (36) that 
Guy a 3 fie huy, 
and therefore 
Guy = 42 uvG =4 OC fu. : : : (39) 


As a consequence of (39), Einstein’s gravitational equations 
(224 of Chapter XI.) become 


Ofu = —16mKTy. . : a \40) 


The validity of these equations depends, of course, on the 
possibility of satisfying the four relations (38), and this is 
tantamount to choosing a special frame of reference. instein’s 
general equations (22) or (22A) of Chapter XI. are covariant 
for any arbitrary transformation. On introducing a new co- 
ordinate system the g,,-potentials of the new system depend on 
four arbitrary functions which define the transformation of 
co-ordinates. These four functions can be chosen in such a 
manner that the g,, of the new system can satisfy any assigned 
four conditions. Thus for our present purpose these four 
functions must be chosen in such a manner that the guy or hy 
satisfy (38). That is, by choosing a special co-ordinate system, 
we can ensure the truth of equation (40) to the first degree of 
approximation. Those who are familiar with the solution of 
the wave equation in electromagnetic theory in terms of retarded 
potentials, will recognise at once that the solution of (40) is 


to — "Aiden dr. |. (43) 


In (41) dz is a volume element of the three-dimensional 
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space (for which *,, %2, x3; represent point co-ordinates) at a 
distance y from the point where fy is being estimated, and 
[Tw] is the value of the component of the matter tensor not at 
the instant «,, when fy is being estimated, but at the instant 
%,—y7; that is, at an instant earlier by the interval of time 
required for hght to travel from the element dr to the point 
where fy» is being determined. This solution determines gy 
also, for by (37) it follows that 


= 5°Bh ag — 2h 
= —h, 
and so 
Ny = Tuy aay 46 uvf 
= — 4 fff [Twn dt + Bur fff{T]lr.dr . (42) 
and 


Buv = Suv + hu. 


One obvious conclusion is that gravitational changes should 
be propagated with the speed of light, but it must be very 
definitely borne in mind that this conclusion is valid under the 
two very restrictive limitations introduced above, viz., that we 
are considering the matter from the point of view of a specially 
chosen frame of reference, and we are neglecting squares, etc., 
Of hiyy. 

An important fact which Einstein discovered is that certain 
types of plane gravitational waves transport no energy, and so 
can hardly be said to have a physical existence. To understand 
the steps by which this conclusion can be reached, we recall 
that the Einstein equation of conservation, viz., 


(Tu), = 0, 
or 
(gL u2)/d%— + 2qT apdg**/ dx, = 0 
can be written in the form 
AT * + ty%)/dx_ = 0, 


where t,, is defined by (10) of Chapter XI., provided the system 
of co-ordinates chosen satisfies the condition g = —1. Now the 
condition (38), which we have just been imposing on the 
co-ordinates, is entirely different from the condition Pe i 
so we cannot draw the same conclusion concerning the expression 
of the conservation law without further investigation. The 
conclusion can clearly be justified to the approximation neces- 
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sary if we can obtain a new expression for t,y which, subject to 
condition (38), will satisfy 


ob yu2/dX%_ = $T apdg%8/d%u 
= 47 ,edh*8/d%x, 
= — Bre Cfaa(2f28/dx, — 35° D/)>x,) 
= 81rd7)*fag/IXydXe . (If %B/I%, — $8%8df/d%,). 


To the degree of approximation involved it can be shown 
that the following expression for t,, will satisfy this equation : 


327Kt yy = fo8/d%u . dfp%/d%, — $df/d%u. If/d%y 
= Bu Dye(Ofa8/Dxy . Yfp*/%e — Ff [Hy . Y//d4%e), - (43) 
Su? = 8*fua, 
so that 
Sub = — fuv fe = — fax fe = — fuss fut = fos 
To the same degree of approximation 


T wy a Syadrpl ” 
= Syadvppdx,/ds . dx,/ds, 


where 


and thus 7,,, 749, 743 are numerically equal to the components 
of the density of the energy-stream within matter. So ¢4;, tgs, 
t43 represent the density of the energy-stream in the gravitational 


field. 
In the case of a plane wave propagated along the axis of 


x, with unit velocity, we have 
Sur = Oy 6(%1 + Xa), 


where the a,, are ten constants and ¢ is a function of the 
argument x, + x, As we are using the specially chosen system 
of co-ordinates defined by (38), it follows easily that 


Gy, + 414 = 0] 
Gg 1 O24 = O} 
patie ae pe , ; » (4) 
41 + G44 = O. 
By means of (43) and (44) we can determine the energy 
stream components. They are 
tay = (327k) ~ 1{[(aee + 59)2/4 + Gos7]h'}?. - (45) 
tye = 43 = O. 
In this ¢’ represents the first derived function of ¢ with 
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respect to its argument. Following a classification by Weyl, 
we can group plane waves into three types :_ 

Type 1 are those for which the ay, involving I or 4 in the 
suffix are zero. These are called “ transverse-transverse 
waves. 

Type 2 are those for which the ayy are zero except 19, 43, 
O94) G34. These are called “ longitudinal-transverse’’ waves. 

Type 3 are those for which the ay, are zero, except a41, O14, 
a4, These are called “ longitudinal-longitudinal ’’ waves. 

It is clear from (45) that plane waves of types (2) and (3) 
transport no energy, and also waves of type (1), for which 
Geo dg, = O atid a, — 0. 


ADDENDUM. 


On account of the severe restriction on the choice of co- 
ordinates imposed by (38), the question of propagation in any 
natural system of co-ordinates is left rather undecided by 
Einstein’s analysis. As this book goes to press a paper by 
Eddington (‘ Proc. R.S.,’’ A 102, Dec. 1922) has just appeared, 
which treats the matter from a wider point of view. He con- 
siders first of all plane waves 


Tae == Ouvp(%4 = V%,), 


where the ay, are constants, V is a velocity of propagation, and 
the co-ordinates are general. These are propagated in the 
negative direction of the axis of x; Using the same approxi- 
mations as before, he works out the components of Riemann- 
Christoffel tensor, and from them those of the Einstein tensor 
Guy. Equating the latter to zero, he finds that such values of 
the gy potentials can exist in the field provided the following 
seven equations are satisfied : 


(I — V*)an, = 0 (46) 
(r — V?)as3 = 0 (47) 
(I — V*)a.3 = 0 (48) 
Gee + O33 = 0 (49) 
Ao4 = ee (50 
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We classify these waves into three groups as before. Each 
group represents a disturbance which can be propagated inde- 
pendently of the others. Equation (49) shows that for a TT 
WAVE Goo, G33, G3 Cannot all vanish; hence V = 1, or TT plane 
waves are propagated with the speed of light. If, however, 
LT or LL waves exist, equations (50), (51), and (52) show that 
the value of V in each case depends on the coefficients of the 
disturbance, and exhibits no sign of approximating to the 
velocity of light. These waves have therefore no fixed velocity 
and, as Einstein discovered, they transport no energy. They 
have, in fact, no objective existence, and cannot be detected 
by any conceivable experiment. By his special choice of 
co-ordinates, Einstein, as it were, compels these spurious waves 
to travel with the same velocity as the genuine TT waves. 
“He imposes conditions on the co-ordinates which bar out 
most of the spurious disturbances possible ; but those which 
have the velocity of light take advantage of their close resem- 
blance to the genuine waves and slip through the barrier ”’ 
(Eddington, Joc. cit.). In short, these spurious waves “are 
merely sinuosities in the co-ordinate system, and the only 
speed of propagation relevant to them is the speed of thought.” 

As a matter of fact, an investigation of the analysis shows 
that the Riemann-Christoffel tensor depends only on the 
differential coefficients of Ay, hz3, and hz, with respect to the 
argument, +, + V%,, and so the tensor depends entirely on the 
TT waves. For LT and LL waves the tensor vanishes and 
space-time is flat ; in other words, the “‘ supposed disturbance 
is an analytical fiction.” 

A remarkable conclusion which Eddington draws from the 
condition 


hoe + Msg = 0, 


satisfied by the TT waves, is the fact that these waves arise 
from very secondary disturbances in the distribution of matter, 
such as periodic stresses occurring in an infinite plate, or a 
circular motion of the plate in its own plane. A backward and 
forward motion of an infinite plate produces no waves, although 
similar behaviour of a small plate would. (They would be 
divergent and not plane.) Variations of T4,, i.e. alternate 
creation and destruction of mass, would likewise be ineffective, 
for “nature, having made no provision for the propagation 
of the corresponding disturbance, thereby automatically pre- 
vents the construction of such a source.” 

But plane waves are too artificial, and in order to illustrate 
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fully propagation of gravitational potential, divergent waves 
have to be studied. This Eddington proceeds to do, making 
use of Einstein’s solution (42), obtained above, and employing, 
of course, the restricted choice of co-ordinates. The reader is 
referred to the paper for the details, but the following broad 
conclusions can be stated briefly. 

Despite the resemblance between the equations of propaga- 
tion of the gy, and the equations in the theory of sound, simple 
isotropic spherical waves of gravitational disturbance cannot 
occur. The reason for this is the fact that if ¢ is the potential 
in the acoustic problem, 


O¢=0 
unconditionally, while in the gravitational problem 
Ogu = 0 


provided (38) is satisfied, i.e., provided the sources obey the 
conservation of energy and momentum. This bars out at once 
such sources as points where matter is alternately created and 
destroyed, from which simple spherical waves might be expected 
to arise. In cases where conservation is obeyed, as, for instance, 
in the case of a rod spinning round an axis perpendicular to 
itself (a problem first treated by Einstein), while simple spherical 
waves occur for some components of gy, they must be accom- 
panied by waves arising from a doublet source for other com- 
ponents. 

Spinning bodies produce divergent waves, and slowly lose 
energy by reason of them, a result first discovered by Einstein. 

Spurious divergent waves can exist just like spurious plane 
waves, but they cannot be readily distinguished from the 
genuine, and Weyl’s classification into three types which can 
exist independently of one another no longer holds. 

Types of TT waves which are inconsistent with the equations 


Guy = 0 


exist ; that is, other physical manifestations not of a purely 
gravitational character accompany them. For instance, we 
can have TT waves for which hy. + h33 is not zero. They can 
exist in space which, although not empty in a strict sense, is 
empty of matter. In short, energy of radiation is present, and 
propagation of light accompanies the propagation of gravitation 
in the case of these “ electromagnetic gravitational waves,” as 
Eddington calls them. 


PART AIL, 


WORLD GEOMETRY. 


CHAPTER XIV. 


ALTHOUGH the Relativity theory has been the direct product 
of physical experiment, yet the appeal to geometrical ideas 
connected with a conceptual four-dimensional world which 
cannot be grasped intuitively has been manifest throughout the 
preceding pages. While removing space and time from the 
category of the absolute, experiment lent support to the intro- 
duction of a new absolute, Minkowski’s element of separation 
between two events. The first step which Einstein took in 
framing his theory of Gravitation consisted in a generalisation 
of the mathematical expression for an element of separation. 
In this step and in his suggestion that gravitational orbits in 
any frame of reference are the projections of “ natural’ world- 
lines or “‘ geodesic tracks ’’ in space-time, he evaded a criticism 
which had been levelled at Newton’s theory by his great con- 
temporaries Huygens and Leibnitz, and which in one form or 
another has occupied the minds of scientific thinkers ever since. 
While every one had to admit the remarkable power of the 
Newtonian formula for gravitational force in summarising the 
astronomical evidence, it was felt that the idea of “‘ action at 
a distance’”’ constituted a serious breach of a principle which 
impressed itself with great effect on scientific opinion, especially 
during the nineteenth century—the principle of Continuity. 
How could two bodies separated by a finite extension in space 
affect each other’s movements ? When Maxwell had shown 
that the theories of action at a distance employed in the electro- 
static and electrokinetic theory of the early nineteenth century 
had to be replaced by the continuous action through a medium 
implied in his electromagnetic theory and explicitly expressed 
in the differential form of his equations of the field, it was only 
natural that similar ideas concerning a possible transmission of 
gravitational action by the same medium from body to body 
should impress themselves on scientifi¢ opinion with renewed 
force. One serious difficulty, however, blocked the way. It 
was generally believed on the weighty authority of Laplace 
. that such an action would have to be transmitted with an infinite 
303 
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speed in order to agree with the observed motions of the planets, 
and such a conclusion was felt to be somewhat at variance 
with the finite speed of propagation of radiation ; indeed, the 
difficulty bore some resemblance to the famous troubles which 
surrounded the question of the “‘ longitudinal wave”’ in the 
elastic-solid theories of the ether, and it was only the investiga- 
‘tions of Lorentz and Poincaré on the possibility of generalising 
_ Newton’s formula to fit the Relativity principle which revealed 
\ the fallacy in Laplace’s reasoning. 

It is clear, however, at the outset that Einstein’s theory 
involves no violation of the Continuity principle, for the simple 
reason that no gravitational force is invoked at all. In Newton’s 
theory the “natural” paths in absolute space are straight ; 
force is that which deflects bodies from these paths, and so 
gravitational force has to be postulated in addition to mechanical 
and electromagnetic forces. In Einstein’s theory there are in 
a definite frame of reference “ natural ’’ paths given by a certain 
differential equation ; “‘force”’ is that which deflects bodies 
from these paths, but it is purely mechanical or electromagnetic 
in character. The idea of continuity is satisfied because the 
“natural”? paths are one aspect of the world-lines or geodesic 
tracks which connect, element by element, two events in the 
universe separated by a finite extension in space-time. But 
this procedure involves of necessity an appeal to geometric 
ideas, which, truth to tell, does not meet with great favour in 
some quarters, and criticisms are from time to time launched 
at what is called the “‘ geometrisation of Physics.” It is to this 
feature that we must now give closer attention, and it will be 
necessary to indicate the connection which exists between 
Einstein’s theory and the famous researches in Differential 
Geometry of Gauss, Riemann, Christoffel, and Levi-Civita. 
The mind of the reader will thus be prepared for a consideration 
of the problem of boundary conditions which is engaging the 
attention of relativists at the moment and leading to rather 
startling conclusions concerning the scale of the universe, and 
he will also be enabled to grasp a further generalisation of the 
Relativity theory by Weyl, which seems destined to relegate 
electromagnetic force to the same position as that into which 
gravitational force has been thrust. 

Nothing brings home the geometrical nature of Einstein’s 
theory more forcibly perhaps than the constant recurrence of 
the phrase, “curvature of space-time.”’ It is meaningless to 
those who are not only unable to visualise such a thing (forming 
no exception among men in that respect), but are also unaware - 
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that it is a name for a definite analytical formula. It will shed 
some light on this concept and its origin if we give a brief 
summary of Gauss’ researches on curvature of surfaces and 
Riemann’s extension of them to u-dimensional manifolds. 


METRIC PROPERTIES AND CURVATURES OF SURFACES. 


_In the first place, if we choose a point O on a surface as 
origin of co-ordinates, and the tangent plane at O as the plane 
% = 0, the equation of the surface can be written in the form, 


z=ax*+ 2hey+by?+t... 


the remaining terms being of the third order at least in x and y. 

If a point P be chosen on the curve in which the plane ZOX 
cuts the surface, and a normal fo the surface be drawn at P, this 
normal does not in general cut the normal OZ. Even as P 
approaches O, the shortest distance between the normal at P 
and the normal OZ bears a finite ratio to the distance OP. 
Similar statements are in general true concerning the normal 
to the surface at a point Q on the section by ZOY. There is 
an exception to this generalrule. By rotation of the rectangular 
axes OX, OY in the plane z = 0, it is possible to find a position 
of these axes for which the coefficient of the term in xy is zero, 
and the equation of the surface can be written 


z=anrr+by+... f : eS 


It is known that in this case the ratio of the shortest distance 
between the normal at P and OZ to the distance OP approaches 
zero as P approaches O. In brief, the normal at a point on 
the section by ZOX adjacent to O meets the normal at O ina 
definite point C,, called a centre of principal curvature. Simi- 
larly an adjacent point Q on the section by ZOY gives a second 
centre C,. OP and OQ are elements at O of the two lines of 
principal curvature which cross at O. At every point on the 
surface there cross two such lines, and at every point there 
exist two principal centres. Such lines in fact form two 
orthogonal families of curves on the surface, and any finite 
number of them give a mesh configuration on the surface. 
Also, it can be proved that if R, = C,O, R, = C,O (the prin- 
cipal radii of curvature at O), then the coefficients a and 6 in 
(1) are 1/2R, and 1/2R, respectively. Now let us draw through 
P the other line of curvature at P, and through Q the other line 
of curvature at Q, these meeting in S (say) ; then OPSQ is an 
elementary rectangle on the surface whose area is 

20 
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R,R,56 805, 
where 80, = LOC,P, and 6, = LOC,Q. 


If from any point C four lines are drawn parallel to the 
normals at O, P, Q, S, these lines cut out on the surface of a 
sphere of radius R drawn with C as centre an elementary 
rectangle of area R%50,50.. Suppose R is so chosen that 
the two rectangles are equal, so that R= (R,R,), the 
surface is said to have at the point O the same curvature as the 
sphere. The “ measure of curvature”’ of the sphere is 1/R?, 
and so 1/R,R, has been named by Gauss the measure of curva- 
ture of the surface at the point O. 

Abandoning these special axes, suppose we choose any 
arbitrary Cartesian co-ordinates, and make x, y, z functions of 
two independent variables x, and x, so that 


Any definite pair of values assigned to x, and x, determines 
one point on a surface, the surface being that whose equation 
is obtained by the elimination of x, and x, from the three 
equations written. (Thus, if 


% = ACOS%, 
y = bsin x, cos 4, 
216 Sin x. Sine 


the surface is the ellipsoid 
x?/a? + y2/b? + 22/c? = 1.) 


When a definite value is assigned to ¥,, and x, varies arbi- 
trarily the point is restricted to a definite curve on the surface, 
thus 

%, = constant 


is a family of curves, and 


x, = constant 


is another family of curves, which in general mark out on the 
surface a mesh configuration. (It is not necessarily a rect- 
angular mesh, nor need these lines be lines of principal cur- 
vature.) 


As an abbreviation for the first and second differential 
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coefficients of x, y, and zg with respect to x, and %,, we shall 
employ the following convenient notation :— 


d&/d% = A; dE/d%_ = ay 
2n/d% = by; d/dX_p = dg 
08 /d%, = Cy; 06/dx%_ = Cy 
944 /D%y = Ay; 0Ay/I%y = yy = Agy = 24q/9%q; IMq/I%y = Ay 


and a similar notation in 0,,,.. ., Cg for the remainder. 

All these are, of course, functions of x, and x,, and are not 
in general constant over the surface. 

Thus if (%;, ¥2) and (x, + 6x,, x, + 6x,) are neighbouring 
points on the surface, the square of their distance apart is 


equal to 
6x? + dy? + 52? 
= ¥(a,6x, + a,8x,)? (2) 
= 2 410% 1? + B20d%—” + 281 00%15%5 
where 
fu = 2a? 
&o2 = 2a," 


812 = 44% = a1. 
Moreover, since it is easily proved that 
(byCy — byC,)8% + (Cya_q — CeAy)8y + (4,0, — @,b,)8z2 =0 


it follows that B,C, — b,Cy, Cy4_ — CoQ@y, 4b, — a,b,, are pro- 
portional to the direction cosines of the normal to the surface 
at the point (%,, ¥,). Now 


3(b1Cg — 54cy)* = Sa? 3a,? — (Sa,a,)* 
= 811822 — 812" 


eo 
where g is the determinant 
| 811 E12 
| 821 §22 


so that the direction cosines are 
(byC2 — b2¢1)/n/8, etc. 


Consider the elementary parallelogram on the surface whose 
corners are the points A, B, C, D with the co-ordinates 


(%1, %2) (%1 + 5% 1, %_) (%1 + 8%, Xo + O%s) (%y, %2 + 5%Q) 


respectively. Since 
20) * 
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AB = 4/(8119%1") 
AD = 4/(8 200%") 


and AC = / (8 118% 1? + £228% 9” + 28120% 18%), 
it follows that 
cos LBAC = £19// (811822); 


sin LBAC = ,/(g/g182). 


In consequence the area of this parallelogram is g?5x,5%., 
and therefore the area of the part of the surface bounded by 
the four lines 


and therefore 


% == My, %_ = Ng, % = Ry, Hy = hey 


is | eiaades oe .~ 3) 


the domain of integration being h,2x,2k,, h,Zx.Zhp. 

The results (2) and (3) show that the mensuration of lengths 
and area on the surface are directly dependent on the values 
of the three coefficients 211, 220, £12 

Gauss demonstrated a still more striking conclusion, viz., 
that the curvature of the surface at a point is a definite function 
of £11, Zee, Ziq and their differential coefficients with respect to 
%, and %,, and in so doing introduced the prototype for two 
dimensions of the Riemann-Christoffel tensor. 

The proof of this result is somewhat tedious and involved 
as obtained by Gauss, and so to avoid a long digression, it is 
given in an appendix. Gauss succeeded in showing that the 
measure of curvature is : 


Ryle . . 7 


where we are employing the Ry.,. symbol defined as in Chap- 
ter IX. to be 


d[II, 2]/dx%_ — dI[12, 2]/dx, + ({12, a}[12, a] — {11, a}[22, a]), 


the summation for the dummy suffix being of course only for 
two variables. 

We can use the properties of these symbols as explained in 
Chapter [X., to find the mixed tensors. Hence we can find the 
contracted tensors R,,, Roo, Ry, (the summation in the con- 
traction being, of course, only fora =1anda=2. Finally, 
we calculate the invariant R, i.e., g4R,, + g?*Ro. + 2g1*Rjp, 
which turns out to be 


— 2Ry012/8, 
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so that the curvature of the surface at the point is 
— R/2. : : . . (5) 


There is another method, due to Levi-Civita, of dealing with 
the curvature of a surface, which dispenses with the use of 
normals to the surface, but is connected with what we may call 
an attempt to carry a vector with a given direction from one 
point on the surface to another. On a Euclidean plane the 
axiom of parallelism defines for us the same direction at two 
different points. The matter is not so easy on a curved surface. 
For simplicity, assume we are dealing with a spherical surface. 
Choose two points P and Q on it, and define a direction at P 
by an element of a great circle passing through P. Can we 
attribute any meaning to “ the same direction”? at Q? What 
great circle through Q (if any) possesses an element at Q 
which is “ parallel’ to the element at P. No circle certainly 
has this property in general if “ parallel” is to be interpreted 
in accordance with three-dimensional Euclidean geometry. 
(Only if the great circle through P happened to be at right angles 
to the great circle passing through P and Q would that be 
possible.) But if we restrict our directions to be tangential to 
the sphere, we can select from all such directions at Q one which 
is uniquely related to the given direction at P, and which could 
be with convenience named as the ‘“‘same’”’ direction as that 
at P. Let us choose the great circle at Q, which makes with the 
great circle PQ an angle on the same side equal to that made 
by the great circle through P, and define the direction of its 
element at Q to be the “same”’ as the direction at P. This 
will certainly lead to no ambiguity nor to any contradiction, 
provided all directions considered are restricted to being 
tangential to the sphere. Now consider a spherical triangle 
POR, and transfer in this manner a given direction at P to the 
‘same ”’ direction at Q (by what we may call a “ parallel dis- 
placement ’’), then transfer the direction at Q to the “same” 
direction at R in the same manner along the great circle R. 
Now transfer the original direction at P to R directly along the 
great circle PR. The two directions at R do not agree, as 

-anyone will readily recognise who is aware of the elementary 
fact of spherical trigonometry that the three angles of a spherical 
triangle together make up an angle greater than two right 
angles. Of course, there is no ambiguity as to which of the 
two directions at R is really the ‘‘same”’ as that at P. By 
our definition it is clearly the latter, and what we prove is that 
successive displacements along any path do not in general 
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arrive at the ‘‘same”’ direction at another point, but only the 
direct displacement along the great circle path. This fact can 
be stated in a still more instructive fashion. After having 
carried the direction from P to R, via Q, transfer it back to P 
directly along the great circle RP. The new direction at P 
does not agree with the original, but makes with it an angle 
equal to the excess of the sum of the angles of the spherical 
triangle above two right angles, and the sense of the rotation 
from the original direction to the new is right- or left-handed, 
according as we go round the triangle in a right- or left-handed 
sense. This fact is referred to as the non-integrability of 
direction. Now it is a well-known result that the value of this 
excess is equal to the quotient of the area of the triangle by the 
square of the radius of the sphere, i.e., the product of the area 
of the triangle and Gauss’ measure of its curvature. The 
result can be easily generalised to any figure on the spherical 
surface. Indeed, it may be extended to any curved surface, 
provided we substitute the word “ geodesic ’’ for “‘ great circle ”’ 
in the above, and employ the “‘ average measure of curvature ” 
over the area bounded by the curve on the surface. In fact, 
this method can be employed to obtain the analytical result 
for Gauss’ measure obtained above. We shall not stop to 
indicate the steps involved, as we shall presently be applying 
the method of “ parallel displacement” to the more general 
case of an m-dimensional manifold. 
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The curvature of the surface of a material body is a property 
which we perceive because the space of our experience is three- 
dimensional. The result just obtained is important in that 
it demonstrates that the analytical measure of that curvature, 
however, depends on the functions which help to express the 
distance between two points on the surface when a system of 
co-ordinates or mesh configuration has been defined, i.e., on 
the inner metrical relations of points on the surface quite apart 
from the relations between these points and points not on the 
surface. We are accustomed to speak of a surface as “ plane,” 
if we do not perceive any curvature ; but it does not therefore 
follow;that the Gaussian measure of curvature of a “ plane ” 
surface vanishes. If we assume that the postulates and axioms 
of Euclid (in particular his axiom of parallels) are true for the 
surface, then it can be shown very easily that the measure of 
curvature is zero, and our actual experience of material bodies 


PROPERTIES OF A MANIFOLD 311 


is such that these axioms and postulates, if not absolutely true, 
represent a very close approximation to the truth. But if we 
replace this set of axioms by another set which is assumed to 
be valid for a plane surface, the measure of curvature in Gauss’ 
sense may not be zero—it is not zero, for example, in Lobat- 
chewski’s Geometry, which is founded on Euclid’s postulates 
and axioms with the exception that Euclid’s axiom of parallels 
is replaced by the hypothesis that through a given point in a 
plane there can be drawn not one but an infinity of lines which 
do not meet a given line in the plane, no matter how far produced 
in either direction. In this Geometry the metrical relations 
between the points on a plane are of such a nature that the 
function R in (5) is positive, and so the measure of curvature 
is negative. In fact, as those acquainted with non-Euclidean 
Geometries will know, the Euclidean geometry of figures con- 
structed with geodesic lines on the anticlastic surface, known 
as the pseudo-sphere, is identical with the Lobatchewskian 
geometry of rectilinear figures on a plane. 

It is only natural for the inquiring mind which has been 
put in possession of these facts to speculate on the possibility 
of a curvature in our three-dimensional space. Intuitive 
apprehension of such a curvature is apparently beyond us in 
the present state of human development, but an intellectual 
grasp of its possible existence is within the power of anyone 
who is capable of realising that the inner metrical relations of 
the space of our experience may not be just precisely those 
which are deduced in Euclid’s Geometry. Nor need we stop 
at a curvature in three dimensions ; speculation soon extends 
to the possibility of importing similar ideas into the treatment 
of any manifold, an m-dimensional manifold being a class of 
entities each one of which is uniquely defined by a group of ” 
numbers. A curve, for example, is a one-dimensional manifold 
in which the entities are points ; a surface is a two-dimensional 
manifold of which the component entities may be taken as 
points or curves; a volume is a three-dimensional manifold. 
The simple tones are a two-dimensional manifold in which the 
numbers refer to pitch and intensity. The class of entities 
which we call colours form a three-dimensional manifold if we 
accept the theory that any colour can be imitated by blending 
three primary colours in suitable proportions. If we consider 
human beings in five limited characteristics—height, span of 
arms, girth, weight, age—we would be dealing with a five- 
dimensional manifold. The latter would be a “ discrete” 

manifold in contradistinction to the others mentioned where 
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there is continuity between the members of the class. Now 
we can always represent any manifold of dimensions not higher 
than three by spatial diagrams, each member of the class 
corresponding to a point in a curve, a surface or a volume, as the 
case may be. Such representation is not visually possible to 
us beyond three dimensions ; nevertheless, the mathematical 
analysis of certain manifolds which are of importance in Physical 
Science is of such a nature that it develops formule and 
equations in four or more variables precisely analogous to 
formulz and equations in three variables occurring in analytical 
geometry. The best example of this is the representation of 
the motion of a dynamical system with degrees of freedom 
by a curve in 2m-dimensional space, in which the geometrical 
co-ordinates are the Lagrange generalised co-ordinates of 
position and momentum of the system. 

A little reflection, however, shows that in the representation 
of a one-, two-, or three-dimensional manifold by a space 
diagram, it may happen that some important property of space 
itself may be of no importance whatever in the representation, 
i.e., there may be no physical property of the manifold corre- 
sponding to this property of space. Thus in the representation 
of tones or colours in the manner referred to above no physical 
significance can be attached to the distance between two points 
on the diagrams, nor to the direction of the line joining them. 
The manifolds are, in fact, neither ‘“‘ metric”’ nor “ affine” (to 
employ the phraseology of Levi-Civita and Weyl). Now we 
have seen that in the Relativity treatment of the space-time 
continuum, a quadratic function of the differentials of the 
co-ordinates is of great and fundamental importance, being the 
square, in fact, of the invariant “separation,” and is the 
analogue of the quadratic function in the geometry of surfaces 
which represents the square of the invariant distance. So 
space-time is a metric four-dimensional manifold. Direction 
can also be postulated as a property of it; hence it is affine. 
Let us therefore consider any affine and metric m-dimensional 
manifold, i.e., a manifold for which there exist physical pro- 
perties whose magnitudes are directly related to a step or 
displacement from one member of the manifold to a neigh- 
bouring member, and to some quadratic function of the 
differences of the co-ordinates of the two members, which 
maintains an invariant value for all modes of co-ordinating 
the members. If now we suppose such a manifold represented 
in an ”-dimensional space, we can assume that such a step is 
represented by an elementary displacement in the space, and 
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that the physical property corresponding to the quadratic 
function is represented by the distance between two repre- 
sentative points. If this quadratic function has constant 
coefficients the space is ‘‘ homaloidal ” or flat ; if the coefficients 
are functions of the co-ordinates, and it is impossible to transform 
to a co-ordinate system in which the co-efficients are constants, the 
space is “ non-homaloidal,” it is warped or curved in a space 
of higher dimensions, and we can arrive at an analytical expres- 
sion for its curvature in the manner suggested above. Consider 
an elementary displacement (A1, A?,..., A”) at a point 
(%1, %2, . . ., %,). There issome displacement (B!, B, . . ., B”) 
at the point (7, Vs, . . ., Ya) Which is equal to the former, i.e., 
has the same direction and magnitude ; but it is not in general 
one given by B\= A*. For example, if it happened to be so 
for one system of co-ordinates, it would not be so for others. 
Thus, transforming to accented co-ordinates, we would have 


A’* = Ayq(x)A*) 
Bix = ayq(y) BS 


and since @, has not in general the same value at (¥%,... %n) 
as at (Vy, ... yn), it follows that A’*=+ B’A, even though 
At = Br, It is clear, therefore, that a displacement at the 
point yy, “ parallel to”’ a displacement at %), is not obtained 
as readily as one might suppose when the space is curved. 
Even if the space were flat equal elementary changes in the 
co-ordinates do not necessarily give equal displacements at 
different points as consideration of polar co-ordinates will 
promptly show. 

Yet there is no meaning in saying that a manifold is affine 
unless there is a one-one correspondence between the directions 
at a point in the representative space and the directions at any 
other point. We have seen how such a correspondence could 
be set up geometrically in a two-dimensional curved space, and 
we can make the attempt by analysis in spaces of higher 
dimensions. Thus if P and Q are two neighbouring points 
x, and x, + 6x), and if the elementary displacement A* at P 
is equal to the elementary displacement B* at Q, then 


B= pA, eo Se (6) 


(the summation is from a = I toa = n) 


where the summation for a and f extends from I to m for each, 
so that there are really »? terms represented by the second term 
on the right-hand side, and where fy,” are 1? coefficients, which 
are definite functions of the co-ordinates, whose form depends 
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on the nature of the manifold or representative space, and will 
presently be discovered. The reader must be warned that 
fry” is not a tensor (mixed of the third order), and so it is not 
written with a capital letter; but it is convenient to use a 
notation in which two indexes in the covariant place mechani- 
cally cancel, as it were, two in the contravariant place leaving 
one in the contravariant place in agreement with the other 
terms.* It might appear at first sight wrong to assert that 
fa” is not a mixed tensor, for in (6) is not fag*A%dxg a vector, 
being the difference of two vectors? This line of reasoning 
overlooks the fact, however, that A* and B® are estimated at 
different points, and so in transforming to another system of 
co-ordinates different values of a, would be involved in each 
case, and fug*A%5xg would not transform as a vector. 

We can justify (6), since the second term of the right-hand 
side is the most general linear expression for the changes in 
the components corresponding to displacement of the vector 
‘“‘ without absolute change.’’ The justification can be made 
more convincing by employing a postulate of Weyl’s that there 
exists at each point a co-ordinate system which is “ geodesic ”’ 
for the immediate neighbourhood, i.e., one in which the com- 
ponents of a vector at the point do not vary when the vector 
is displaced without change to a neighbouring point. Let &) 
be this system, and let X* represent the vector at &, in this 
system. Let the transformation equations from the x co-ordi- 
nates to the £ co-ordinates be given by 


E) = $)a9%a 
and 8x 1= 1 


ad\’Sa> 


where ¢,, and ya, are functions of the co-ordinates such that 


brotha = TitA= p 


=oifAtuy. 
(See Theorem I. of Chapter VIII.) 
Hence 
AX = h,,X7 (value at the point P) 
and BN =x: Op AN Sn 5 en) 
so BA — AX = Ipyald%Xp . XV8XB 


= Pyadpyr/dXg : Adxg 


* Of course, the 8 in $%g is actually to be considered in the contra- 
variant position ; in fact, some writers use for the sake of uniformity 
the notation (%)4, (8%) instead of x), d¥,. 
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which agrees with (6) if we put 


Sa” = — PaudPaa/d%y- 


If we take A* to represent an elementary displacement 
PR = dx), which on parallel displacement to Q becomes QS, 
then the co-ordinates of S differ from those of P by 


dx, + 8x, — fagrdx,8xe. 


If we now consider the displacement PQ displaced parallel 
to itself along PR into the position RS’, then the co-ordinates 
of S’ differ from those of P by 


dx, + dx, — faprOxdxp, 
which can be written by a modification of dummy indices, 
dx, + dx, — fpa*dxgdxq. 


It is a necessary condition for an affine geometry that S 
and S’ should agree to the first order, i.e., that the space should 
be flat to this order. This requires the condition 


Su®=fuX . . . . 9 


which can as a matter of fact be deduced from the expression 
given above for /,,%. 

The change in the components of a covariant vector which 
has suffered a parallel displacement can now be obtained 
readily. We can define a parallel displacement in this case by 
the condition that if A, is a covariant vector, and A® any 
arbitrary contravariant vector, then the invariant product 
A,A®* is to have the same value after a parallel displacement 
of both vectors. Hence if By is the displaced covariant vector 
at the point x, + 6x), we have 


B,(AY — fagvA*Sxp) = A,A? 
or (By a Ay — fyg*B,8%p) AY = 0, 


interchanging dummy suffixes in the last term. 
Hence, since A? is arbitrary, 


By, =Ay ee eee - 8) 
or By = A) + frat 5% 


since quantities of the second order are being neglected. 
It is of great importance to be quite clear about these 
“spurious” changes in A* and A), which accompany “ no 


absolute change” in the vectors. To repeat, if the scheme of 
transformation from unaccented to accented co-ordinates is 


AX == eS 
A)’ a Wee Ne 
then 
BUS — f'agA'*dxg" som (4a ae 4 )q) (As — fey*A®dx,) 
and a + f'xptAa dx," = (Oya a 8d)a) (Ag + tee pox), 


where aay + ddan, bry + dda, are the values of the transforma- 
tion coefficients at *, + 6%). 

Now let A* be any contravariant vector function which 
has the value A* + 8A’ at the neighbouring point x, + 5%). 
Then since the difference between two vectors at the same point 
is a vector, it follows that 


OAA + FaprAa *5%g 


is a contravariant vector. But 6x, is a contravariant vector. 
Therefore 


DAN Ke tr fat A® : - (9) 


are the components of a mixed tensor of the second order, 
covariant in uw, contravariant in 4. 
Similarly it can be shown that 


0A 0%, ata : : = (xO) 


is a covariant tensor of the second order. 

Comparison of these expressions with the covariant deriva- 
tives obtained in Chapter IX. suggests that fay” = {Au, v}, and 
we shall verify this presently, although we shall see later that 
the verification implies a certain restriction on our geometry. 

An important particular case arises when the vector to be 
displaced is dx,/ds, ds being the invariant distance between 
two neighbouring points, so that the ‘“‘ spurious’ change in 
the components is 

— fupdxgax,/ds. 


If this vector be displaced not merely parallel to itself, but 
also “ along ”’ itself, we obtain successive elements of a geodesic 
line, whose equation is therefore 


0(dx)/ds) = — fapOx~dx,/ds \ 


or d®x/ds? + figrdx,{ds . dxajds = 0 Gz) 


Having defined the meaning of a parallel displacement more 
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closely by means of these formule, we shall investigate the bear- 
ing of them on the metric properties of the manifold. The 
distance between two contiguous points of the representative 
n-dimensional space is the square root of a quadratic function 


&apoXOXp, 


and the magnitude of a contravariant vector A is the square 
root of 


£agA a4 B. 


If this vector experiences a parallel displacement, it does not 
undergo any change in magnitude, hence 


(2gap/9%e- A8A® — gue fye8AY — gap A®fy<°A7)Sxe = 0, 
or on suitably modifying the dummy suffixes 
(28ap/d%« — faca — facg)A*A® dx. = 0, 
where Pano = Svafan®, and so also frpy = 2"* fra 
Since the variations 6x, are arbitrary, 
frou + fur = 2Bry/ 0% 


Te ae dag = uy OX 


ae + fruv — Ova] O%Xp- 


Hence, as fryy = fury 
Siwy = £(dgar/d%p fs Wuy/ d%r =< Wap! dx), 
= [Ap, v] 


and in consequence 
fru” = r[Aw, a] = fu, a}. - (12) 


In addition to laying down an expression for the distance 
between two points, we also shall require to deduce an expres- 
sion for the area of an element of a two-dimensional surface in 
order to make progress towards an analytical expression for 
‘* curvature.” 


Let P be a point whose co-ordinates are (%1, %2, . . ., %n); 
Q a point whose co-ordinates are (¥; + 4%, %, + d%,.. ., 
Xn + d%,); Ra point (%, + 8%, %2 + d%, ..., %, + O%n); and 
Sa point (% + dx, + 844, %_ + d%_ + 8%, « . ., % + din + Oxy). 
We can refer to the point (%,, %., ..., xy + CL Bee Boe ae Xn) 
as the projection of Q on the “local” A-axis, and the point 
(X%4, Xa, -- +» %, + ay, © «, Hp + ax, .~ ., %n) as the pro- 


jection of Q on the “ local” Au-co-ordinate surface. 


318 RELATIVITY 


Now from the result (3), proved earlier, it is clear that the 
area of the elementary parallelogram whose corners are P and 
the projections of Q, R and S on the local Au-co-ordinate surface 


is 
Ap, Au |t(dx dx, — dx,8x)) . : - +(0a} 


where we write | «A, pv| as a convenient symbol for the deter- 
minant 

Sky xv 
Sau Sar 


This result can be generalised into the following theorem. 
The area of the elementary parallelogram PQRS is the square 
root of the sum of the m4 terms 


Lap, y3|dSeedS7® | S(t) 


where dS’ is the anti-symmetrical tensor dx 5x, — dx,6x,, which 
corresponds to (but is not in general equal to) an element of 
area in the local Au-surface. The expression (14) is of course a 
quadratic expression in the variables dS“, whose coefficients 
are the determinants | «A, pv|. 

To justify this result, we observe that as dS* is a contra- 
variant tensor of the second order, and dS*dS” one of the 
fourth order, therefore 


Baye psdSBdSv5 
is an invariant. 
So also is 
LadLpyASBdS®y 
an invariant. 
Since dS¥8 = — dSy, it follows that the expression (14) is 


an invariant. Its value is therefore unchanged if we transiorm 
to another set of co-ordinates, and in particular to one in which 
PORS is the local 12-plane. In these (accented) co-ordinates 
all but four of the terms in the transformed expression (14) 
vanish. These terms correspond to 


a—y—T, pO 2 
obo Nees py = 
a=y=2, p=6=71 
a=0 = 2. So Es a. 


and so the expression of (14) is equal to 
| 12, 12/aS “ase, 
which is the square of the area of PQRS. 
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CURVATURE OF A MANIFOLD. 


We are now in a position to deal with the question of 
curvature in a manifold by the method suggested in connection 
with two-dimensional surfaces, viz., by the change produced 
in the direction of a vector by parallel displacement round a 
closed curve in the representative space. 

Let a vector A* be carried by parallel displacement round 
a small complete circuit which embraces an area whose projec- 
tions on the local co-ordinate planes correspond to the anti- 
symmetrical tensor S“, Then if AA* represents the change 
in the x-component of the vector after circulating round the 
circuit, 

AAk = /JA*/dxg . dxg 


= — /fg.*Adxg by (6). 
Taking the terms under the integral in pairs and applying 
Stoke’s Theorem we obtain 
AAK = E/ffd(fyaA®)/d%8 — d( faa A®) /24)dSPr. 
The integrand is by a further application of (6) equal to 
A*(dfya"/dx8 — Of pak/d%y) — (fya®fpe* — faa'fye*)AS, 
which by a suitable modification of dummy suffixes is equal to 


A*(dfyak/dx8 — Ypat/I%y + fratfae — fea‘fye*). 


So that 
AA® = 3 //Raip,tAdShy , test tS 


Ryu = YK /d4u — ay |d%y + frvfap— frurfov®. (16) 


Since frp” = fAu, v}, we see that Ray is the Riemann- 
Christoffel mixed tensor which we met in Chapter IX., and 
equation (15) expresses its significance in the geometry ot multi- 
dimensional space. We can bring it into closer relationship 
with Gauss’ result for a two-dimensional continuum as follows. 

In the first place, we make use of the result that 


£apA*BB 


is the (invariant) geometric product (A . B) of the vectors A 
and B. This is a generalisation of the fact that g.gA“A8 is 
the square of the invariant magnitude of the vector A. Then 


_by (15) 


where 
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(AA . B) = $RagygneA*BedS8y, 


where we consider the circuit as infinitesimal and drop the 
integral signs. Hence 


(AA . B) = }RapyeA*BedSey. 


Now let A and B be the vectors dx, and 5x) represented by 
the sides PQ and PR of the elementary circuital parallelogram 
considered above. Now suppose that vector PQ, when trans- 
ferred round the parallelogram, comes into the position PQ’, 
then AA is vector QQ’, and the magnitude of PQ is equal to 
the magnitude of PQ’. Thus in the limit QQ’ is perpendicular 
to PQ, and so the geometric product of AA and B is equal 
to the area of the parallelogram PQRS multiplied by the ratio 
of QQ’ to PQ, ie., by the angle QPQ’, which measures the 
change in the direction of the vector A after its transference 
round the circuit. 

Hence circuit area X vector deviation 


= $Ragyed% 9% 457. 
It is also equal to 
ER cpyad% bX qd SAY 
=—Cl £Rapycd%ghX ASA. 
Therefore it is equal to 
tRapyAS“dS8y 
or ER ByeadS*BdS1¢ 
(modifying the dummy indices). 
Hence 
vector deviation iy tR ByeadS*hASV€ 
circuit area (circuit area)? 
— ReyeadS*BdS7 
| aB, ye |dS*8dSv¢ 
aes __ (aBye)dShdSye 


[aB, vel dSeBd Sve (20) 
in Riemann’s notation for his tensor. 

This result is called by Riemann the curvature of the 
n-dimensional space for the given elementary two-dimensional 
circuit. It is, of course, an invariant and independent of any 
choice of co-ordinates ; it is dependent upon the ratios of the 
dS, that is, on the orientation of the circuit at the point. If 
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it so happens that each component of the covariant curvature 
tensor («Auv) or Ray bears the same ratio to the corresponding 
determinant |«A, uv| as every other, then the curvature is 
independent of the orientation and is uniform. 

This result is clearly a generalisation of Gauss’ result for 
the curvature of a two-dimensional continuum. 

The link between the Einstein Theory of Gravitation and 
Riemann’s geometrical researches is now obvious. Einstein, 
in adopting a general quadratic expression for the value of the 
square of the invariant separation between two physical events, 
was naturally led to use the same mathematical analysis as 
had been developed for the treatment of a multi-dimensional 
continuum. In space-time where gravitation is absent, or at 
most entirely removable, every component of Ray or Ray is 
zero, Or in geometrical language such a space-time is a flat or 
“homaloidal ” four-dimensional continuum. In it any vector 
carried round any closed circuit by a parallel displacement 
would return undeviated from its original direction. In actual 
space-time, except at a great distance from all matter, this is 
not so; there is a curvature. A vector will not return un- 
deviated ; yet the curvature is not arbitrary. There is a cer- 
tain limitation on it expressed by 


Ray (or Gay) = 0. 


Taking equation (15), it is not difficult to interpret this 
geometrically in the following manner. Suppose we co-ordinate 
the points of an m-dimensional continuum in any way ; consider 
the point P and neighbouring points P,, Ps, . . ., Px, so that 
P, is on the local A-co-ordinate axis through P, and so that the 
x -co-ordinate of P, differs from that of P by unity, the other 
co-ordinates of P) being, of course, the sameas that of P. With 
P and any pair of these points P, and P,, we can construct 
an elementary parallelogram ; we shall call it a Au-parallelo- 
gram. Now carry a vector at P originally directed along the 
d-axis around the pr parallelogram ; on returning to P it will 
in general be deviated and have a component along the I-axis, 
which was of course originally zero. Next carry the vector 
round the 2 parallelogram and obtain a component along the 
2-axis. Repeat this operation for all the parallelograms pI, 
U2, 43, .. + pm. Ry, is proportional ‘to the sum of the com- 
ponents thus obtained. So if Einstein’s law is true for the 
continuum, the sum of these components is zero. (Each one 
of them would, of course, be zero in a homaloidal continuum.) 
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In space-time, there would only be two components to add, for 
there are only four parallelograms for each Aw, one of which 
py. yields no individual change, while the A-parallelogram also 
gives no individual component along the A-axis, since Ray” 
(not summed) is identically zero. So that if we carry a vector 
directed along the 1-axis round the 23-parallelogram, and obtain 
the 3-component after the change, it is equal in value and of 
opposite sign to the 4-component of the vector after it has been 
carried around the 24-parallelogram. 

The deviation of a vector may be presented in physical 
terms of space and time separately thus. Two particles are 
projected with equal velocities and directions from two neigh- 
bouring points. In a non-gravitational field the line joining 
them remains parallel to itself in the Euclidean sense. In a 
gravitational field it does not do so in general. 

There is also a method of treating the curvature of an 
n-dimensional continuum which resembles that first used in 
the early part of this chapter for a surface, and which brings 
out forcibly the significance of the invariant G or the “ scalar 
of curvature.” 

Consider the points of the continuum to be curved in a 
homaloidal (mz + 1)-dimensional continuum ; refer it to axes 
so chosen that the equation of the n-dimensional continuum is 


22 = kyX\7 + kox.? + . . . + RnX,? + higher powers. 


This means that the origin is a point of the v-continuum, 
and the axis of zis normal to it, while the axes of %), %5, . . ., Xn 
are tangential to it and parallel to the lines of principal curva- 
TUTE. 


Then 
Os? = 622.4 6%,7 + dx,2? + .. . + 8%, 

== (I + Ay2,7)8%,? + . . . 4+ 2h hax x,8%,0%, +. 2 | 
Hence 


Bie el 
Bru — RyRyX Xp. 


On working out the Christoffel indices and going to the 
limit at the origin, it can be shown that 


= — Rh Rey. 
Me 
For space-time 


G = — 2(kyhy + hykg + Ryka + Baby + hoky + hgh.) 
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The k) are, of course, reciprocals of the principal radii of 
curvature. ’ 

In a two-dimensional continuum curved in three dimensions, 
k,k, is the Gauss measure of curvature. In space-time curved 
in five dimensions we have the sum of six similar terms as a 
measure of the curvature invariant or scalar of curvature. 
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In carrying through the proof of equation (4) of the chapter 
by the method of Gauss, it will be convenient to introduce new 
symbols h,,, 22, hy, to represent the three determinants : 


Aso Don Cas 
ay ny CR 
dy 0s. 4 


Ayg Oye Cy 
Ge Owes 
a, be Cy 


Ay, Oy Ci 
a, by Cy 
a, dy Ce 


respectively, and /, m, n to represent the three direction cosines 
(byC2 — beC1)/./g, ete. 

Let 7 be a principal radius of curvature at a point on the 
surface whose Cartesian co-ordinates are (x, y, 2) ; the co-ordi- 
nates of the corresponding centre of curvature are (x + /, 
y+mr,z+nr). Butif (x + dx, y + dy, z + 62) is an adjacent 
point on the corresponding line of curvature through (x, y, 2), 
the co-ordinates of the centre are also (x + dx + (1 + 8dr, etc.), 
and so 


6x + 7vdl =O 
by + 75m = 0 
6z + rén = 0, 


or 
A,0X, + a,6%,+ rdl = 2) 
COX, + C:d%_ + rdn = o} 
Multiplying these by a), 0,, c, respectively, and adding, we 
obtain 


£119%4 Ste & 120% + 7% (a0) == (0), 
Say) 


hence x(a, 


Now 

O, 

— (da) 

— 8%, 5'(a,,1) — 8%.3(a,) 
— (14,5%, + hy5%.2)/,/g. 


So writing p for 7/,/g we obtain 


Nea 


an he 


(ire — 811)8%1 + (Ayep — &12)9%_ = 0. - (2) 
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Similarly multiplication of equations (1) by a», bz, cz respec- 
tively and addition yield 


(12 — 812)8%1 + (hae p — Bas)d%_ = 0. - (3) 

The elimination of 6x, and 8x, from (2) and (3) gives us a 

quadratic equation in p whose roots are 7,/,/g and 74//g, 7, 

and r, being the two principal radii of curvature at (x1, %,). 
As this quadratic equation is 


(Ayton — hy9")p? — (Ayes + hoo8i1 — 2) 2212)p + g = 0, 
it follows that 
7472/8 = g/(Ayhos — hy), 


and thus Gauss’ measure of curvature, viz., I/7,7. is equal to 


(Ayo — hy»*)/g?. . . ~~ (4) 


To reduce this to the form quoted in (4) of the chapter, we 
first observe that by the rule for multiplication of determinants, 


hyo, = | BAy1422 3444, FAA, 
ZAy4o2 2A Baya, 
YESS OY Bey ee 


By differentiating the relations 
Bay? = Sir 24d, = 2, Vg? = Boo 


with respect to x, and %9, it can be shown that this determinant 
is equal to 


III, 2]/dv%, — BAQd04,/d% 5 22.0 " (22,2) 
hese! ‘561 812 
[1I, 2] 812 E22 


In a similar way /,,? can be expressed as a determinant and 
shown to be equal to 


d[12, 2]/d%, — Bagday0/d%4 [2a (x2, 2] 
[12, I] 811 812 
[12, 2] £12 §22 


Bearing in mind that d@,,/d%_ = 0@12/0%,, we see that 


Nyon — hyo? = g(d[II, 2]0%_, — d[12, 2]d%)) 
+ (gie[11, 2] — Zee[ II, 1])[22, 1] 
+ (€1e[11, 1] — &1:[11, 2])[22, 2] 
— (£10[12, 2] — &eel12, I])[12, 1] 
— (g12[12, 1] — 811/12, 2])[12, 2]. 
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Since 
git = goole, 2 = e1i/e, g = — £12/8, 


we find that 
hyyhoo gl hy,” 


is equal to the product of g and the expression 
III, 2]/dxv_ — d[12, 2]/dx, + {12, a}[12, a] — {11, a}[22, a] 
(summed for a = I and 2). 


But this expression is R,.;2, and thus the curvature of the 


surface is 
Rjy919/8. 


CHAPTER XV. 


THE CURVATURE OF SPACE-TIME. 


In the early days of the Relativity Theory a number of objec- 
tions were urged against it which were based on the fallacious 
assumption that the restricted theory asserted the irrelevance 
of rotation in an absolute space in the same way as it asserted 
the irrelevance of uniform motion. Such a statement would 
clearly have been an unjustifiable interpretation of the postu- 
lates of the restricted principle. Indeed, it was known that in 
so far as mechanical effects are concerned, the invariance of 
the Newtonian laws for a change to a frame of reference in 
uniform tvanslatory motion relative to the original is not 
accompanied by a similar invariance for a change to a frame 
in uniform rotatory motion relative to the original. And as 
far as optical effects are concerned, the experiments of Sagnac 
have left no doubt of the important fact that rotation cannot 
be so lightly dismissed from consideration as translation. In 
Newtonian theory acceleration and rotation have a character 
apparently more absolute than uniform translation. The 
generalised theory of Relativity has robbed simple acceleration 
of this absolute character; the change in frames is signalised 
by a different set of g,,-potentials, and all gravitational fields are 
assumed to be relative in character. But rotation still appears 
to retain something of the absolute about it. Consider, for 
example, the relative rotation of the stars and the earth. From 
a purely kinematic standpoint it is, of course, immaterial 
whether we speak of a rotation of the stars around the earth 
or of the earth relative to stars. Dynamically, however, we 
cannot regard the matter as satisfactorily settled if we consider 
the alternatives equally true in any fundamental sense. We 
find, to quote the famous experiment of Foucault, that the 
plane of vibration of a pendulum on the earth’s surface rotates 
slowly around the vertical in a manner which suggests that the 
plane of vibration is maintaining an unchanged orientation in 
an absolute space in which the earth is rotating. Experiments 
with as accurately balanced a gyroscope as can be obtained 
also suggest that the axes of a perfectly supported gyroscope 
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would preserve an absolute direction which would, of course, 
be changing relative to directions marked on the earth’s surface. 
Then there is the phenomenon of the protuberance at the earth’s 
equator, accompanied by the flattening at its poles. Moreover 
the rate of rotation calculated from experiments with the 
gyroscope and pendulum is very exactly equal to the rate of 
relative rotation of the earth and stars as directly observed, 
so that even were the stars invisible to us we could still detect 
the earth’s rotation relative to something. This would suggest 
that there is an absolute space, and while uniform motion of 
translation in the space produces no discernible change in any 
physical experiment which would permit of a measurement of 
the speed of such motion to be made, motion of rotation does 
affect the course of such experiments and thereby yields a 
measure of its actual magnitude. 

The natural reply to this is to point out that the transforma- 
tion of co-ordinates from one trame of reference to another in 
rotation relative to it leads to a modification of the potentials 
which would certainly produce a type of motion in the second 
frame differing from that which occurs relative to the first. 
For instance, in the so-called “‘ inertial frame,’ in which the 
“fixed” stars are not really fixed, but still have comparatively 
slow movements, the matter tensor 7,, would proclaim its 
existence most decidedly by the component 7 ,, on account of 
the smallness of the velocities of matter in the frame compared 
to the speed of light. By contrast, in a frame turning with the 
earth, the speed of these same stars would be relatively enor- 
mous, and the whole character of the matter tensor would 
be altered.* These very great velocities, even for matter so 
distant, might produce, therefore, gravitational effects respon- 
sible for the behaviour of Foucault’s pendulum and the gyro- 
scope. Indeed, many years before the appearance of the 
Relativity Theory, Mach, in his Principles of Mechanics, had 
forsaken the philosophy of an absolute space and attempted to 
present these occurrences from a standpoint somewhat similar 
to that just outlined. 

But there still remains an objection which may be urged 
against the relativist. Doubtless it is true that the trans- 
formation of co-ordinates involved in introducing a rotating 
frame of reference makes no difference in the form of the 


* The author has had on various occasions to point out to the 
perplexed that the fact that velocity of matter cannot surpass a certain 
invariant velocity (that of light) is a deduction of the restricted theory, 
and must not be transported without qualification into the general. 
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differential equations of motion, but it does make a decided 
difference in the limiting values of the g,, as events are removed 
further and further from the neighbourhood of matter, i.e., 
there is a change in the boundary conditions. 

In Newtonian theory the values of the potential function 
are determined not only by Laplace’s and Poisson’s equations, 
but also by the condition that they vanish at infinity in any 
frame of reference. Now it requires no great trouble to see 
that no similar statement is possible in Einstein’s theory as far 
as it has been developed in the previous chapters. The equa- 
tions of gravitation are not linear in the g,,, and so the fields 
of a number of particles are not strictly additive ; yet as any 
natural gravitational field disturbs the values of the potentials 
from the values : 


—I re) ) 5 | 
Oo —I O oO 
: , ED 
oO o —I 4 
fe) re) o +1 


by very small amounts, it is not seriously inaccurate to regard 
the solution for a group of bodies as the sum of solutions for 
the individual bodies. So that if V is the value of the New- 
tonian potential for a number of bodies in comparatively slow 
motion in a frame of reference, the values for the g,, in this 
frame differ from the following scheme by negligible amounts. 


— (I+ 2V) 0) 0) 0) 
o —(1+2V) 0) 0) (2) 
fe) O -— (1+ 2V) 0) ‘ 
fe) fe) (e) I— 2V 


At great distances from all matter it is clear that the scheme 

(2) approaches the scheme (1). 
Suppose now we transform to a frame accelerated with 
respect to the former, so that we have a transtormation such as 


De Nea Xa, 
the other co-ordinates remaining unchanged, it is quite easy to 
verify that the values of g,, are transformed in such a manner 


that as x, approaches the values of the g,, approach the 


scheme. , 
—I O O 


(o.@) 
o —I to) ce) 
O o —I O (3) 
Oo (@) oO oe 2 
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a quite different set of boundary values to (1). Similarly, if 
we change to a frame in rotation relative to the first, say, by 
a transformation such as 


$= tat, 


we once more find infinite values appearing in the limits of 
certain of the gyy. 

It is therefore not an unreasonable hypothesis if anyone 
chose to assert that although we cannot single out any frame 
of reference as privileged by means of the differential equations, 
we could conclude that frames in which the boundary con- 
ditions correspond to the simple scheme of the restricted 
theory, viz., (I), have an element of absoluteness about them 
which is denied to others. As to how closely actual values of 
the g,, do approach the scheme (I) in space accessible to 
observation, there appears to be little doubt. De Sitter asserts 
that ‘‘ within the solar system the limits of uncertainty are 
very narrow: say the eighth decimal place. As we get further 
away in space, or in time, or in both, the limits become wider : 
at a distance of a million light years we can perhaps only guaran- 
tee the second decimal place. How the g,, are in those portions 
of space and time to which our observations have not yet 
penetrated, we do not know, and how they are at infinity of 
space or of time we shall never know.’ * These statements 
are based on the slight deviation of the spectral lines of even 
the most distant observed bodies, such as the spiral nebule, 
from the positions for terrestrial sources, and on the smallness 
of the relative velocities of the stars. 

There appears, therefore, at first sight something absolute 
about the scheme of boundary values given in (1). In Rela- 
tivity Theory there is strictly no distinction between gravitation 
and inertia. They are both fused in the g,,, at all events so 
long as we regard the differential equations alone. But the 
remarks just made show that the boundary conditions would 
seem to single out those parts of the g,, written in (I) or (3) as 
distinct from the remaining parts, which can be directly traced 
to the effect of known matter, so that it might be legitimate on 
that account to refer to the latter parts as ‘“‘ gravitation ’’ and 
the former as “‘ inertia.’’ But if we transform to other axes, we 
have seen that while the deviations of the g,, from the values 
at infinity can still be traced to known matter and called 
“ gravitation,” the “inertia”? has quite different values to 
those occurring in the first frame. 


* “ Monthly Notices,’’ Nov. 1917. 


~ at ama ttn ae, emaiyermeahenctin, eee 
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Now to the thorough-going relativist who asserts that 
Relativity has a philosophic foundation and justification apart 
from the teachings of an experimental science, this is decidedly 
awkward ; but it is not the business of the upholders of the 
physical principle of Relativity to defend a particular philo- 
sophic standpoint. Nevertheless, in the nature of things they 
must concern themselves with the problem thus raised and 
consider fairly if further investigation can yield any support to 
the postulate that there is a “ Relativity of inertia,’ as well 
as of space, time, electromagnetism, and gravitation. 

Now if such a contention is to be supported, very little 
examination is required to see that the only set of boundary 
values at infinity for the g,, which would be invariant for all 
transformations are 


(4) 


Oro oO 
OV OrOrS 
SP OHyore) 
2) @) jens) 


Thus, the whole of the g,, would be deviations from the 
values at infinity, and the “inertia” as well as gravitation 
would be due to the effects of matter. Such a conclusion would 
be entirely in agreement with the standpoint of Mach and his 
followers, who assert that if all matter were destroyed with the 
exception of one material particle, then this particle would not 
possess inertia, and, as will appear later, if in supporting this 
suggestion all matter known to us—stars, nebule, clusters, 
etc.—prove insufficient, they assume the existence of still 
more matter hitherto unobserved. Before proceeding to inves- 
tigate the possible validity of the scheme (4), however, the 
reader must carefully note that the introduction of this scheme 
involves a postulate. De Sitter calls it the ‘“‘ mathematical 
postulate of the Relativity of inertia.” 

We have to consider how it is possible to obtain values of 
the g,,-potentials which will, in the part of the world accessible 
to observation, preserve values differing so little from those as 
obtained hitherto that the experimental tests are still satisfied, 
and yet at greater distances will gradually approach the scheme 
(4) or any other scheme which might prove to be an invariant 
set of boundary values for defined types of transformation. 

It appears, on investigation, that such values are obtainable 
by a slight modification of the law of gravitation, which would 


become 
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Gur — Fw(G — 2a) = — 8a yy . 

or Cie aeatre 1e)} ; (5) 
from which is deduced 

G — 4 = 8neT. . ; . (6) 


As will be shown presently, these equations yield values of 
£uy Which satisfy the suggested boundary conditions at enormous 
distances, provided the constant A is chosen so small that the 


previous law 
Guy em ae 8ak(Tyy ram 3€uvl) 


is practically obeyed at distances of a smaller order of magni- 
tude. But, in order not to interrupt the more general discus- 
sion of the point involved, we shall defer the proof for a little. 
One very striking conclusion will make its appearance, viz., that 
the three-dimensional world is not infinite, but is finite in size 
though unbounded, just as a two-dimensional surface like that 
of a sphere or ellipsoid is unbounded though of finite dimensions. 
Indeed, if we consider the integral 


f/(- £11) dX, 


which is the length of the semi-axis of x, in natural measure, 
it is clearly finite if g,, approaches a zero of sufficiently high 
order as %, increases, and it will be seen that in the solutions of 
(5) Ziv Lee, L33 Satisfy this condition. 

In his book “ Raum, Zeit, Materie,”’ 4th edition, section 34, 
p. 248 (English translation, p. 273), Weyl does not admit the 
cogency of these arguments based by Einstein on considerations 
of the interconnection of the world as a whole and leading to 
this modification of boundary conditions. He, on the other 
hand, contends that the differential equations in themselves 
contain the physical laws of nature in an unabbreviated form, 
and even without boundary conditions all ambiguity is excluded. 
Thus, for example, the laws of the electromagnetic and gravita- 
tional fields satisfy the principle of causality, i.e., the time 
derivatives of the “ phase quantities” A), Fur, guy, {Au,v} are 
expressed in terms of these quantities themselves and their 
differential coefficients with respect to the space co-ordinates, 
so that from the state of affairs at one instant we can predict 
the state of affairs existing at the next. Further, we have no 
occasion to concern ourselves about boundary conditions at 
enormous distances beyond our present powers of observation, 
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for our co-ordination of events is only valid without ambiguity 
in the neighbourhood of some definite world event. The 
argument is a subtle one, and may perhaps be made more clear 
by a consideration of Gaussian co-ordinates on a surface. In 
general, the lines forming the “‘ meshes ”’ of the Gauss method 
intersect in more than one point on the surfare, and so these 
meshes can only be used unambiguously to define the co-ordinates 
of points on a limited position of the surface, and similar limita- 
tions must in general hold for any co-ordination of the events 
of the world. One of the most obvious ways of effecting 
co-ordination in the world by the Gauss method is to take a 
definite point-instant of the world as origin O, and introduce 
co-ordinates %1, %, %3, ¥4, such that at O itself 


ds? = 8x47 — 8x? — bx,? — 8x2. 


There is a three-dimensional space x, = 0 which surrounds 
O. In this we mark off a region R such that in it 


8x1? +- 6%? + 8x5? 


is everywhere definitely positive. From each point of R draw 
the geodesic line of the world which is orthogonal to R, and 
draw it in the time-like direction, i.e., “‘ towards the future.” 
These lines will fill up without intersection, and therefore with- 
out ambiguity, a certain four-dimensional neighbourhood of O. 
Now assign to any event or world-point in this neighbourhood 
co-ordinates such that %,, %2, x, have the same values as were 
assigned to the world-point in R from which started the geo- 
desic passing through the event in question, and x, has the 
value equal to the proper time or separation between the world- 
point in R and the chosen world-point. Taking a definite value 
for x,, this will define for us a region R’ of another three-dimen- 
sional space surrounding the point (0, 0, 0, %,4), and it is not 
difficult to show that the geodesics are orthogonal to this region 
also. Thus along any of the geodesics in question there is no 
change in %, %», %3, and so 


as = a0" 
or 844 = I. 
The geodesics are also orthogonal to x, = 0. Hence 
Bis = Sos = 8s4.—= 0* . . nd 


* This can be seen by an appeal to three-dimensional Euclidean 
geometry. If the axis of z is normal to the plane of xy, but the axes of 
x and y are not necessarily at right angles, the square of an element 
of length might contain a term in drdy, but not terms in d¥éz or dydz. 


334 RELATIVITY 


provided x,= 0. Since, along a geodesic, %,, %2, %3 do not 
vary, it can be seen from the equations of a geodesic 


i, + {oB, w}iake = 0 


that 
(44, w} = 0 
if oe ees Or 
Hence 
(44, w] = 0, 
and ‘since ¢7, = 1, 
28 14/0%4 = Weal dX q = 2 g4/0%q =O. . (8) 


Combining (7) and (8) we find that 


in = Bon == Boi = € 


at the point on a geodesic adjacent to the point in R, i.e., at 
all points in R’. Thus we arrive at a co-ordination which 
involves a family of geodesic lines with time-like direction and 
a one-parameter family of three-dimensional spaces, orthogonal 
to the lines, viz., x, = constant. Using this system of 
co-ordinates, 


{uv, 4} = — $2@y/d%q (u,v = I, 2, or 3), 


and so by means of the gravitational equations we can express 
the derivatives 
d{uv, 4}/d%4 


in terms of the g,,, their first and second order derivatives 
with respect to x4, Xg, x; and the-{uv, 4} themselves. So the 
principle of causality is satisfied, but we cannot say how far 
we may proceed along the geodesics or travel out into the 
family of three-dimensional regions until intersection takes 
place and it becomes impossible to express the co-ordinates of 
all events in this manner in a continuous and singly-reversible 
(“umkehrbar eindeutig ”’) way. 

But although Weyl questions the validity of Einstein’s 
argument from boundary conditions, he does not, in conse- 
quence, oppose the conclusions, but he attributes greater 
weight to the following consideration, which would also suggest 
the same result. 

In our stellar system the relative velocities of the stars are 
remarkably small compared to the speed of light. How does 
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such a system persist ? How is it that it has not in ages past 
dispersed into infinite space? To a Maxwell ‘‘ demon” the 
problem of the persistence of an enclosed gas system would 
present itself in a somewhat similar manner. Yet we demon- 
strate the validity of such persistence by statistical methods 
which also prove that serious departure from uniformity of 
density is of very brief duration and of very limited extent, 
and that velocities differing largely from a certain mean are 
comparatively small in number. But any attempt to apply 
such an argument to the stellar system breaks down if we 
apply it in connection with the law of gravitation as hitherto 
formulated. or an ideal state of equilibrium in which there 
is a uniform distribution of bodies at rest in a static gravita- 
tional field is irreconcilable with such laws. On the Newtonian 
theory such a distribution in infinite space would lead to an 
indefinite and infinite value for the intensity of gravitational 
force at any defined point. On Einstein’s theory the world-line 
of a particle at rest, i.e., a line along which %,, %», x3; are constant, 
is a geodesic if 
{44,u}=0 (4 =I, 2, or 3), 

or 

[44, w] = 0, 
and therefore 

Waa/ dX, = O. 


Thus such a distribution at rest is only possible if 
£44 = constant, 


which is incompatible with the Einstein law of gravitation, as 
can be seen readily by remembering that in the first approxi- 
mation I — g4, is equal to twice the Newtonian potential, and so 


Agas = 4rKp. 


That being so, it is pointed out by Weyl that, if we appeal 
to the Principle of Stationary Action, it is clear we could have 
introduced just as readily for the action of the gravitation field 


the invariant integral 
/q(aG + p)dw 


SqGdw 


where a and f are numerical constants. This is clearly equiva- 
lent to (6) above, and it appears that the ideal state of equili- 
brium considered above is compatible with this modification. 


instead of 
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The demonstration of this we shall also defer for the present, 
and deal with it along with the finiteness of space at a later 
stage, for there is a third line of argument due to Einstein 
distinct from his first line, and from Weyl’s, which leads us 
once more to the same modification of the law of gravitation 
as before. 

At present Physical Science is committed to an electrical 
theory of matter. All matter is constituted of charged cor- 
puscles (electrons, nuclei). The inertia of matter is but a 
measure of electromagnetic energy, either “‘ bound ”’ to these 
corpuscles or “‘ free,’ in the form of radiant energy. One 
essential difference between the ‘‘ bound ” and “ free’ energies 
is the fact that the proper or rest mass of the former is not zero, 
while it is so for the latter. This may be seen in a simple 
manner by the aid of the restricted theory; for the mass of 
anything travelling with the speed of light would be infinite 
unless its rest mass were zero. But this is a somewhat imperfect 
manner of stating the distinction. From the point of view of 
the generalised theory, the scalar T of the matter-tensor T,, 
is not zero, but equal to uw, the proper density of the matter, 
while the scalar E of the electromagnetic energy-momentum 
tensor F,, is zero, as can be proved easily by a reference to 
the expression for it at the end of Chapter X. The origin of 
this distinction is closely connected with a very serious diffi- 
culty in the electrical theory of matter, which still awaits a 
solution. This difficulty is simply the existence of the electron 
itself. If we accept it as a fundamental concept—as the 
chemists of the nineteenth century accepted the atom—the 
difficulty does not arise; but if we wish to treat it as a 
structure and derive it from the equations of the field, we are 
at once perplexed by the fact that it coheres and does not 
dissipate itself under the mutual repulsion of its parts. Some 
years ago Poincaré suggested that the non-Maxwellian cohesive 


forces might be regarded as a pressure whose value could be 
represented by 


270" 


where o would stand for the surface density of charge on the 
electron; this expression being derived from the well-known 
electrostatic theorem that the repulsion on a part of the surface 
charge of a conductor can be represented as a normal tension 
on the surface of amount 270? per unit of area. Such cohesive 
forces would, of course, introduce a new term into the energy 
of the field, whose amount would be equal to the product of 
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the average pressure and the volume of the electron. Con- 
ceiving, for illustrative purposes alone, the form of the electron 
to be spherical, the additional energy would be 


$77a°2710%, 
which is equal to 
877°a9(e/47a*)*/3 
= 07/04 
where e is the electronic charge and a the radius. Now the 
electrostatic energy of the field of a spherical electron at rest 
is, of course, by elementary theory 


e7/2a. 


Hence the energy of field and cohesive forces is 2¢e2/3a, 
which is actually the well-known expression for the rest energy 
(or mass) of a spherical electron derived by treating the magnetic 
energy due to its motion as kinetic energy. 

Of course, the existence of such a pressure as Poincaré 
suggests could not be deduced from the present theory of the 
electromagnetic field, but bearing in mind the nature of the 
Relativity Principle as a test or criterion for physical theories, 
we must be prepared to consider whether any tentative hypo- 
thesis can pass this test. The gravitational equation first 
suggested by Einstein was 


(Op sr 


T,, is the matter tensor and no theory as to the nature of 
matter is involved. This equation is of the kind called 
“macroscopic,” i.e., it presupposes that the granular struc- 
ture of matter has been smoothed out, and the density 
occurring in T,, is an averaged density. If, howevcr, we 
wish to regard matter “ microscopically’ as a collection of 
charged particles, we must replace the right-hand side by 


— 82KE yy, 


for this gives expression to the hypothesis that matter is 
electrical in origin. But the scalar invariant E of E,, is zero, 
and this introduces a contradiction, for the scalar of the left- 
hand side is — G, which cannot be zero within matter. 

This contradiction can be removed by choosing in the 
microscopic equation a tensor involving G,, and g,, whose 
invariant is zero, and it is easy to see that such a condition is 
satisfied by the equation 

Guy — BwG = — 8reE yy. ’ - — (9) 
22 
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This leads to the mixed tensor equation 
Gy” — tgy"G = — 8aE,’. : <2 CTO} 
But as the contracted derivative 


(Gu*)a zs 4(gu*G)a 
is identically zero, it follows from (10) that 


£(8u°G)q = — 8ax(Ey*)a 
or 0G [x= Sal pal*s) : isi 


ce 


Therefore in the ‘“‘ void,” i.e., outside the world filaments 
of the electrons, but including the world tubes of any spatial 
regions containing “ free’’ energy of radiation, the curvature 
G is constant, because since J# is zero in such regions of the 
world, 

dG/dx, = O. 
As G is the same in all parts of the void, whether they 


contain free energy or not, we see that this constant value of G, 
represented by Go, satisfies 


Guy — 28uvG = 0 
or the new equation for gravitation outside matter is 

Guy — AB = 0 
where 

A = G,/4. 
This suggests that the new macroscopic equation within 
matter is 
Guy’ — $8yG' = — ples 

or Guy = — Oral d het) (12) 


where we write G,,’ for the tensor G,, — Agy, and therefore G’ 
for G — 4A. 
By contraction 


8axT = G' 
and by contracted derivation 
— 8r«(Ty*)q 


| 
1 


the contracted derivative of G’,” — 4g,"G’ 


” ” ” ” a ree is Sas (G a 4A) 
”» ” ” ” ae Lu it 
0+ Agta nae 


ia 


FINITE SPACE AND THE ELECTRON 339 


and so the laws of conservation of energy and momentum are 
still satisfied. ; 

In this G is an average curvature for a “ physically small ” 
extension of the world, and we see that this average scalar 
curvature exceeds the scalar curvature in the void by an 
amount depending on the amount of matter present. We can, 
it we please return to the microscopic aspect by considering 
equation (11). If dx,/ds refers to derivation along the world- 
line of an electron, 


40G/dxq . dx,/ds = — 8rnF ag ]8dx,/ds 


= — 8axp.~ FopJ*J8 


where py is the proper charge density of the electron. 
But since F,, is anti-symmetric, the right-hand side is zero. 


Therefore dG/ds = 0, 


or G is constant along the world-line of an electron. 

This gives us the picture of extremely thin world-filaments 
within which there is a constant curvature G,;; between the 
filaments lies a portion of the void where there is another and 
smaller constant curvature Gj. This is the microscopic view. 
Macroscopically, filaments and void make up the world-tubes 
of “‘ matter ’’ throughout which there exists an average curva- 
ture G, smaller than G, but larger than G). The remaining 
parts of the void are “ outside matter,” but in them the curva- 
ture is still G,, and not zero, as in the equation first proposed 
by Einstein. 

From the macroscopic point of view G — Gy determines the 
density of matter ; from the microscopic, G; — Gg determines 
the non-Maxwellian binding forces of the electron or the Poin- 
caré pressure. 

Along all these different avenues of approach the evidence 
brings home to us the necessity for a modification of the law 
ot gravitation to the form suggested above in (5), viz., 


Guy — Buy = — 8aK(T yy — $8 wT) 
leading to the scalar equation 
G — 4 = 8a«T, 
where A is a constant curvature which may, if we please, be 


taken so small that the modification does not interfere with 


the experimental observations. 
Addressing ourselves now to the problem of finding an 
22" 
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expression for the separation element and a distribution of 
matter which will satisfy this equation, we make the following 
simplifying assumptions, viz., neglect all pressures and internal 
forces and suppose all matter to be at rest. 

The covariant matter tensor T,, is given by 


Tw = Sualyp 1 8 ; 
= BuaSvBHoraXp 


where py is the proper density of matter. 
With the assumptions just made all the 7,, are zero, except 
T 44, which is equal to 


84a8 apio% aXp 
= Bask sator a” 
F = §4abo» 
since 
ds? aa £ 410%, 


by reason the statical nature of the distribution. 
Now put 


Mo = Ha + be 


where pq is the average proper density and p, represents local 
deviations from this average which are responsible for purely 
gravitational disturbances. Neglecting these local disturb- 
ances, i.e., supposing uw, = 0, we have from (5) 


Gy, — ASa1 = 4K Ma811 


two similar equations in 22 and 33 fa 
I 

Gay — AS ag = — Brrkpa8 as + 47K Ha 14 
= — 4k pa8 14 

Einstein suggests that on the vast scale which we are con- 
sidering (still neglecting local disturbances of the g,, due to 
local deviations of the matter density from y,) the following 
expression for 5s? holds good : 


ds? = bx,? — 6x,* — R* sin? (%,/R) (x? + sin? x.8x,52). (14) 
where FR is a constant length of enormous value. 

The values of g,, are 
811 = — 1, 822 = — R* sin? (x,/R), &53 = — R? sin? (x,/R) sin? x9, 


° 4a — 1 
and from these it can be calculated that = 


G,, = 221,/K*, Go5 = 22oa/R®, Gag = 2253/3, Gaa-= 0; 
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which agree with (13), provided 
Nae Re Arkin . (15) 


The solution (14) corresponds to an expression for a line 
element do 1m space given by 


do? = $7? + R* sin? (r/R) (86? + sin? 6542) .» (16) 


where we regard xj, %s, ¥3 as polar co-ordinates 7, 0, ¢. 
Expression (16), however, differs from the customary 
expression, viz., 
do? = dr? +. 7?(56? + sin? 654?) : x.) (9) 


in such a way that it implies that our three-dimensional universe 
is the unbounded but finite hypersurface of a four-dimensional 
hypersphere of radius R. Near enough to the origin, sin (7/R) 
can be confounded with 7/R and (16) and (17) agree. As we 
proceed further from the origin, however, actual spheres drawn 
in our space with radius 7 begin to behave like small circles on 
a sphere drawn round a given point as centre, with increasing 
spherical radius. Just as the circumference of the latter is not 
anv but 27R sin (r/R), so the area of the surface of a sphere 
becomes on the hypothesis in (16) 


47R? sin? (r/R). 


This increases to a maximum as 7 increases to 7R/2, and 
thereafter decreases to zero for the limiting distance 7 = aR. 
The volume of our universe is then finite and given by 


7R 
/4nR? sin? (r/R)dr, or 27?R3. 
0 


As the average density of the matter of the world is given 
by (15), we find for the total mass of the universe 


aR/2k. 


Thus the size of the universe depends on its mass. 

If we follow out the consequences of this hypothesis of 
Einstein we arrive at some bizarre conclusions. They can be 
obtained most readily by keeping in one’s mind the analogy of 
a spherical two-dimensional continuum. Taking two points 
A and B, and then considering a third point C gradually receding 
from them, at first 2CAB + LCBA is less than two right 
angles, and so anyone who assumes that space is “ flat ”’ finds 
that ACB is positive, and there is a positive parallax. As 
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the distance of C from AB increases to 7R/2, and beyond that 
to wR, LCAB+ LCBA increases to two right angles, and 
beyond that to three, so that the parallax of C would by inference 
become zero and then negative. Rays of light emitted from a 
point and following the great circle tracks of this space would 
return to a focus at the antipodal point, and diverging once more 
would produce the appearance of suns and stars which would 
be merely ‘‘ ghosts.” The fascinating picture is, however, 
somewhat spoiled by the thought that local gravitational dis- 
turbances would probably deflect the rays, and this aberration 
would at least destroy the perfection of focus ; not to mention 
that absorption by the “ world-matter ”’ would probably reduce 
the energy of the beams to zero ere the journey round the 
universe could be completed. However, if, for example, an 
“anti-sun’”’ did exist, it would appear not at the antipodal 
point of the sun as it is now situated, but as it was situated in 
ages past when the light now reaching us was emitted. Indeed, 
as several journeys round the universe could be assumed on 
the basis of such speculation, there might be several ghosts of 
every star haunting regions formerly occupied by it, or regions 
antipodal to these. One thing which the expression 7R/2« 
makes quite clear is the inadequacy of the amount of matter 
known to us to fit this new hypothesis of Einstein. Thus the 
mass of the sun, when multiplied by the astronomical constant 
x, has the dimensions of a length whose magnitude is 150,000 
cms., or 1°5 kilometres.* — (See Chapter XIII. pp. 260.) leas 
estimated that our stellar system has a mass ot the order 10? 
that of the sun. On the supposition that the spiral nebule 
represent 10® such systems, we find that the R corresponding 
to this mass is about 10! kilometres or 30 parsecs, which is not 
even as large as the average distance of the stars visible to the 
naked eye. In order, therefore, to allow for the very large 
stellar distances known to us, and, further, to permit us to 
regard them as only a part of the greatest possible distance 7R, 
we must postulate a relatively enormous R, and in consequence 
the existence of immense quantities of world matter, perhaps 
diffused through our universe with very small density, but 
concentrated here and there into stellar systems or existing in 
innumerable dark stars. Known matter, for example, is insuffi- 


*In fact, it is quite customary in works on Relativity to refer to 
the sun’s gravitational mass as 1:5 “ kilometres.’’ This, of course, 
arises from the fact that «M/r is a pure ratio, viz., the ratio of the squared 
velocity of a particle describing an orbit of radius 7 to the squared 
velocity of light. 
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cient to produce the centrifugal gravitational effects at our 
earth’s surface by its diurnal rotation in our natural frame of 
reference, and so Einstein’s hypothesis is directly in line with 
the demands of the philosophy of Mach. 

Despite the attiactiveness of this speculation, it suffers 
from one serious drawback, which will be evident if we return 
to the point at which this chapter began, viz., consideration of 
boundary conditions. As we have written the expression for 
és? in (14), the question of boundary conditions does not arise, 
for there is no boundary. But, of course, it is easy to make a 
transformation of co-ordinates which will permit no limits to 
be assigned to one of the co-ordinates at least. For example, 
if we were considering the geometry of the surface of an ordinary 
sphere, we could make a representation of it on a tangent plane 
at the origin by writing 


w = Rtan (r/R), 


which would involve infinite values of @. 
If we make the same transformation in (14), written as 


5s? = 8f2 — 672 — R* sin? (r/R) (50? +- sin? 65¢?), 
we obtain 
ds% = 842 — des?/(r + Aos?)? — o7(50? + sin? 054?)/(1 + Aw?) (18) 


where 
A= TR. 


This corresponds to the “ flat ’”’ world of an observer at the 
origin, who naturally takes w, 0, ¢ as polar co-ordinates. Of 
course, (18) shows that this flat world is not governed by 
Euclidean axioms. For one thing, its scalar curvature is not 
zero, but is, of course, equal to the finite value for the form (14) 
from which the transformation has been effected. We make 
another transformation which gives us Cartesian co-ordinates 
in this observer’s flat world, viz., 


%, = wsin6cos d 
%_ = e%sin Osin d 
<== cos 0 
%, = 1, 

whence we obtain 
8s? = £ag0%,0%p, 
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where the g,, are given by the scheme 


—k-+Ak*x,? AR®% 4% 5 Ne ko 
AR2x% 4%, — Rk + Ak*x,? AR tO (19) 
ARAi x 1 AR2x%,%,  —- k + ARPx,5? O : 
fo) fe) O I 


(We write & for (x + Aw?) —}.) 


If @% = 0, (19) degenerates into the scheme (1). 
As w increases indefinitely, scheme (19) gradually approaches 


O O O O 
O 
O O oO (20) 
O O oO oO 
oO O O it 


Now this is not the scheme (4), which is the only boundary 
scheme which is invariant for all transformations. Scheme (20) 
is, however, invariant for transformations to accented co-ordi- 
nates in which the transformation equation 


4! = f(%1, Xa, Xs, Xa) 
degenerates into 
Xq = %q 


atinfinity. Nowsuch a limitation is a violation of the restricted 
principle of Relativity. As Eddington says: “We have a 
feeling that Einstein’s new hypothesis throws away the sub- 
stance for the shadow ;’”’* he “‘ has restored the differentiation 
between space and time by assuming the space-time world to 
be cylindrical, so that the linear direction gives an absolute 
time.”’ Or, in the words of de Sitter: “It thus appears that 
the system only satisfies the mathematical postulate of Rela- 
tivity if the latter is applied to three-dimensional space alone. 
In other words, if we conceive the three-dimensional space 
(%1, %2, Xs) with its world matter as movable in an absolute 
space, its movements can never be detected by observations : 
all motions of material bodies are relative to the space (x4, %2, 3) 
with the world matter, not to the absolute space. The world 
matter thus takes the place of the absolute space in Newton’s 
theory, or of the inertial system. It is nothing else but the 
inertial system materialised. It should be pointed out that 
this Relativity of inertia is only realised by making the time 
practically absolute. It is true that the fundamental equations 
of the theory, the field equations, and the equations of motion 

. remain invariant for all transformations. But only such 


* Report, p. 87. 
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transformations for which at infinity ¢’ = ¢ can be carried out 
without altering the values (1A),”’ [our (20)].* 

This drawback to the hypothesis of a finite universe is 
avoided, however, if we accept a suggestion made by de Sitter 
that on the vast scale considered the element of separation is 
given by 


ds*= cos? (x1/R)dx.?—8x .?—R? sin? (x,/R) (5x_2+-sin? x 5x52). (21) 


On working out the G,, for these values we find that 
without exception 
: Cur = 38ur/R?, 
which satisfies (13) 
if A = 3/R? 
and Lo. 


The difference between this hypothesis and that of Einstein 
is most marked. In Einstein’s speculation the space-time world 
has a natural curvature of 4A or 4/R?in the void, which, however, 
is due to the world matter. This curvature of course is greater 
than this within matter (G — 4A = 87«T); it becomes 6A or 
6/R?, for instance, within matter of average density p,. 

On de Sitter’s assumption there is no world matter (u, = 0) ; 
there are just local “islands ” in a vast void. These will, of 
course, modify locally the natural curvature 3/R? of the space- 
time world, which is itself due entirely to the modified law of 
gravitation. 

If we write (21) in polar co-ordinates 7, 6, ¢, then transform 
as before to a flat world tangent at the origin by the transfor- 


mation 
ws = FR tan (r/R) 


and then introduce Cartesian co-ordinates, we obtain a scheme 
of gu, which agrees with (19), except that the bottom right-hand 
constituent becomes (1+ Arv?)—! instead of unity. So at 
infinity we get the limiting scheme (4), which is invariant for 
all transformations. De Sitter’s hypothesis, therefore, involves 
no breach of the principle of Relativity. 

There is one interesting feature of de Sitter’s suggestion 
which holds out some hope of ultimately receiving confirmation. 
The multiplication of 64? in (21) by cos? (7/R) implies that as 7 
increases, the observed time of natural processes increases. 
The assumption is the same as that raised in the discussion of 
spectral line displacement in a gravitational field. If some 


tuUMNERAS.” LXVIIL, p.-3;.1917. 
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atomic process, for instance, has a universally constant value 
for its proper time, Ss, then obviously as 7 increases and 
cos (r/R) diminishes, d¢ will increase. Indeed, at a distance 
aR/2 from the place of observation, 5¢ would be infinite, and 
all such processes would stop relative to these observers. Of 
course, to observers situate at this place itself, phenomena in 
the neighbourhood would present their normal appearance, 
while stagnation would prevail for them at our former origin. 
At the zone 7R/2 away from us light is also reduced to rest ; it 
cannot get round the universe as in Einstein’s universe, so 
there will be no anti-suns; no information can come to us 
from regions beyond for the same reason. Stars at this zone 
will have the same parallax as a star at distance FR in Euclidean 
space (this can be seen by consideration of a two-dimensional 
spherical space), and as stars beyond this barrier will be invisible 
to us, this parallax will be the least possible. 

On account of the slowing down of time at great distances, 
spectral lines from a source at great distance ought to be 
displaced somewhat towards the red end as compared with 
terrestrial sources. Now there is some evidence that there is 
such an effect in the case of the spiral nebule. Of course, 
velocities of approach or recession of these systems would 
produce spectral displacement towards violet or red, as the 
case may be; but if, as seems probable on grounds which have 
been investigated closely by de Sitter in the paper quoted 
earlier, velocities of recession should not occur more frequently 
than velocities of approach, then it would appear that the 
existence of abnormal Déppler effects (preponderating some- 
what towards the red) in the case of objects at enormous dis- 
tances as compared with our nearer stellar neighbours, yields 
some little confirmation to de Sitter’s world. But the observa- 
tions are still too few to afford certitude to any statement. 

By transformation to other co-ordinates the complete 
pseudo-sphericity of de Sitter’s world can be shown. Thus if 
we write 

sin (7/R) = sin €sin w 
tan (u/R) = cos tan w 
we obtain 


8s? = — R%{Sc? + sin? w[SZ? + sin? ¢(362 + sin? 05¢2)]}, 


which suggests polar co-ordinates, R, w, £, 0, d, in a five-dimen- 
sional space. But as the angles ¢, w are imaginary, the spheri- 
city is rather spurious, resembling the pseudo-Euclideanism of 
the space-time world of Minkowski in the restricted theory 
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after the introduction of imaginary time. If we introduce 
real angles and write’ 
C= 0, w= w’, 
we see the hyperbolical form of our world, since in that case 
ds? = R?{dw’* — sinh?w’[Sf’? + sinh?t’(362 + sin? 0842)]}. 
In fact, if we write 
=, = ht Cosh ay 
X, = Rsinh w’ cosh ¢’ 
x; = Rsinh o’ sinh & cos 0 
%, = Rsinho’ sinh ¢’ sin 6 cos ¢ 
x, = Rsinh o’ sinh ¢’ sin 6 sind 
we have 


ae 2 : ? a 
%4 HoX + X52 + Hy? + X52 = R?, 


which is the equation of a four-dimensional hyperboloid in a 
space of five dimensions. Also, 


of) = - 2 2 2 2 2 
65" = 6%," — Ox ,* — 6%,7 — 6%," — 6x,?. 


For de Sitter, space-time has an existence and a form 
independent of the matter it contains (which only intervenes 
locally to modify the general curvature) ; space-time would 
still exist even if matter were destroyed. Such a conclusion 
will, of course, be resisted violently by the materialist philosophy 
of Mach and his followers. 

Taking the equations of motion as formulated in Chapter IX. 
we can obtain the motion of a particle either in Einstein’s 
universe or in de Sitter’s, and deduce some interesting conclu- 
sions. They are worked out very fully by de Sitter in the paper 
mentioned above, and the reader is referred to it for further 
information. 

It should be pointed out for the benefit of those unfamiliar 
with multi-dimensional geometry, that the spherical three- 
dimensional space of Riemann is not the only kind of finite space 
which can be conceived to be a physical reality. In Riemann’s 
space all straight lines starting from a point intersect in another 
point whose distance from the first measured along any of these 
linesis wR. Many years ago, however, the astronomer Newcomb 
pointed out that it was possible to,lay down a set of axioms 
and postulates which lead without any contradiction to the 
conception of a space in which a straight line returns to itself 
after going round the universe, but does not intersect any line 
more than once. Newcomb’s investigations, which are very 
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interesting, can be found very clearly and concisely expressed 
in quite a brief paper to Crelle’s ‘‘ Journal,” 83, pp. 293-299 
(1877). In the opinion of Einstein and de Sitter, Newcomb’s 
“ elliptical’ space is really simpler than Riemann’s spherical 
for representation of the finite physical world. In the “ ellip- 
tical’ space there are infinitely many points at the greatest 
possible distance from a given point, and they lie on a straight 
line, which is 47R from the given point. This is in contrast 
to spherical space, where there is only one point, the “ anti- 
podal,”’ which is at the greatest distance, that distance being 
mR. Right round Newcomb’s space is wR, but 27R round 
Riemann’s. The volume of the latter is 27?R?, of the former 
m?R°, Of course, in elliptical space, even with Einstein’s hypo- 
thesis of “ cylindrical time,” the existence of anti-suns, etc., 
would no longer follow. 

In conclusion, de Sitter has endeavoured to make a rough 
estimate of the value of R from a variety of astronomical data, 
and finds values of such an order of magnitude as Io0!2 to 10!8 
radii of the earth’s orbit, i.e., about 107 parsecs, or 1028 cms. 
This would make the curvature A about 10 ~ 5° (cm.)~ 2. 


CHAPTER XVI. 


GEOMETRY AND PHYSICAL FORCES. 


It has been pointed out by more than one writer on Relativity 
how progress in physical theory during the past quarter-century 
has been effected by the removal of cramping restrictions on 
our ideas. The partial abandonment of the concept of a rigid 
body implied in the Lorentz-Fitzgerald contraction gave an 
impetus to research which led directly to the earliest formula- 
tion of Relativity. The total abandonment of that idea, 
combined with a freer attitude towards simultaneity, gave us 
the Minkowski world. The recognition that a more general 
metric than that used in Euclid’s geometry was needed for this 
world gave us gravitation. It now remains to indicate how a 
further loosening of bonds can give us electro-magnetic force, 
and may yet succeed in revealing to us the cohesive forces on 
the electron which still lie outside Maxwell’s electromagnetic 
theory. Doubtless it is true that the whole theory of the 
electromagnetic field is perfectly consistent with the hypothesis 
of general Relativity, but there is not apparently that close 
connection between the electromagnetic potentials and field 
tensors and the quantities g,, which define the metric of the 
world as exists, according to Einstein, between these same 
quantities and the gravitational field. It was Weyl * who 
first pointed out how a still more complete metrical treatment 
of the world than that used in Riemannian geometry affords 
a means of identifying the electromagnetic potentials with 
certain quantities introduced in this wider view of geometry. 
Eddington + has shown that even Weyl’s geometry suffers from 
a restriction; he has worked out certain conclusions which 
can be drawn from its abandonment, and suggests that we may 
thereby find some unsuspected relation between this extremely 
wide geometry and the non-Maxwellian binding forces on the 
electron, although he has not succeeded in doing so at present, 


* « Sitz, Akad. Preuss.,’’ May, 1918, and his book, “‘ Raum, Zeit, 
Materie.”’ 
t “Proc. R.S.” 99, A. 697, May, 1921. ‘‘ Phil. Mag.,” 42, Nov. 
1921; 43, Jan. 1922, 
349 
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and, indeed, makes no assertion that his generalisation will 
provide the materials for the solution of that difficult problem. 

It is a commonplace in the literature of Relativity to point 
out the nature of our experimental observations. They are 
almost entirely judgments of coincidences between fiducial 
marks on instruments such as rulers, protractors, micrometer 
screws, galvanometer scales, spider lines in eye-pieces, and 
observed marks on bodies, photographic plates, or optical 
images. We note that a particular event in one chain of 
observation coincides in space and time with a particular 
event in another chain, or that the world-lines of two distinct 
objects intersect in a certain common world-point. In order 
to discover relations between such observations we introduce 
co-ordinates or number-sets, which on our present views may 
be perfectly arbitrary, provided they are unambiguous and 
definitely distinguish one event from another, and then set up 
an expression for a measured interval or separation between 
two events. This almost amounts to an admission that Bergson 
is right in asserting that intellectually we do not appreciate 
the flow of life, but merely visualise static sections or kinemato- 
graphic pictures of life and reason about these. 

One essential feature of this method of treatment is the 
introduction of the idea of congruency which grew up, of course, 
with the idea of a rigid body, permanent in configuration. If 
two marks on one body coincide with two on another, they will 
always, and at all places, do so provided we take certain 
well-defined precautions. That is an assertion so eminently 
reasonable and “‘ commonsensical ’’ that denial of it appears to 
shake the whole fabric of thought. The bewildered ‘“‘ man in 
the street ’’ asks in helpless amazement, ““ What next!” Yet 
it is just that statement which Weyl proposes to remove from 
the category of completely valid assertions, and, as he can 
easily establish, he defies no definite experimental evidence in 
so doing. After all, our ideas of a rigid body have already 
received some rude shocks. In truth, the effects of temperature 
and pressure on our most “ solid’ of materials are such that all 
we could assert of old was that with very definite care we have 
experience of bodies which as accurately as we can observe, 
do comply with the idea of congruency, and preserve an 
unaltered configuration with lapse of time or after movement 
in space. But the Relativity implied in that statement is 
apparent to anyone who reflects that provided all bodies 
change their configuration similarly after experiencing the 
same displacement, our notions concerning congruency would 
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require no modification as far as the phenomena were con- 
cerned. When Lorentz suggested that movement might 
contract the “rigid”? body, the conditions for ‘ congruent 
figures ’ had to be closely defined indeed, and nowadays the 
concept of the “‘ rigid body ”’ has ceased to play any vital part 
in physical science. 

Still, we do maintain that we can measure bodies, and, of 
course, our material standards can serve as measurers of time 
also, by adopting the velocity of light “in natural measure ” 
as our unit of velocity. In laying down a co-ordinate system, 
we introduce no statements about measurement. Expressions 
for intervals of length, time, or separation can afterwards be 
introduced in terms of these co-ordinates, but the usefulness 
or validity of these can only be tested by the employment of 
material standards or “‘ gauges,’”’ which are themselves part of 
the observed world. Of course, the procedure is an easy one 
apparently ; we set up a standard gauge at one place, and then 
transport it, or accurately made copies of it, to all other places 
where required. But supposing that the journey produces 
some definite effect on it; not only so, but suppose that the 
effect is not the same for different ways of making the journey. 
Thus suppose two copies of the same gauge which fit accurately 
at A are transported to B, one by a path APB, the other by a 
different path AQB. What would be the result on the mathe- 
matical treatment of geometry if it were admitted that the 
gauges do not fit true at B? To be sure, our measures have 
not so far given any definite support to an assertion that they 
would not do so; but all measurement is in the last resort 
affected by some error, instrumental or personal, and we cannot 
certainly assert that they would fit to the most ultimate degree 
of accuracy. The point at present is not whether they do or 
do not fit, but what would be the result of admitting that pos- 
sibly they do not. It must be grasped clearly that in speaking 
most generally of a different path, we consider time as well as 
space. Thus both gauges might actually be transported along 
the same path in the frame of reference, but one might pass 
through the intermediate positions at quite different times to 
the other; that would constitute difference of path from our 
enlarged point of view. 

Such an admission would appear to leave our measuring 
system extremely arbitrary. We could, of course, set up a 
standard gauge at A and then define the routes along which the 
copies should be transported to other places where measure- 
ments are to be made. But naturally the question would 
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arise—on what grounds are we to select these privileged routes ? 
And from that it is only a step to the query—why should there 
be any privileged routes at all? We have refused to endow 
any system of co-ordinates with special rights ; why do so to 
any system of gauges ? Is it not possible to express geometrical 
facts in a manner which renders these expressions as indepen- 
dent of a gauge system as they are of a co-ordinate system. 
Weyl has shown that it is possible, and has pointed out that 
the new geometrical quantities introduced for the purpose of 
such expression appear to be related to the potential functions 
of the electromagnetic field in a manner similar to that which 
connects the metric quantities gy» with the gravitational 
potentials of Einstein. 

The connection between Riemann’s geometry and the new 
one suggested by Weyl can be made clear ina simple way. In 
Riemann’s geometry the transport of a vector round a closed 
path in a multi-dimensional continuum by successive elementary 
parallel displacements does not in general preserve the original 
direction of the vector. Weyl’s suggestion is that neither does 
it preserve its gauged magnitude, for the admission that two 
gauges which fit at A may not fit at B, after one has travelled 
along APB and the other along AQB, amounts to a statement 
that a gauge which fits a standard gauge at A may not fit it 
after travelling round a closed path APBOA in which the 
return journey from B is not the outward journey merely 
reversed. In short, there is a non-integrability of magnitude 
as well as of direction. 

But in order to make any headway at all we must postulate 
that such a thing as a “‘ congruent displacement ”’ exists, just 
as we postulated that such a thing as a “ parallel displacement ”’ 
exists. To be sure, the vector A* at the point x) is not in 
general parallel to the vector A* at x, + 6x); so now by this 
further enlargement of our ideas the squared distance, which 
is gauged as /* at x,, is not congruent to the distance, whose 
square is gauged as /? at x, + dx).* But there is a one-one 
correspondence between distances at x, and distances at 
%,-+ 6x, The vector at x, + 6x), parallel to A’ at x, was 
defined by | 

5A = — fgrOXp, 
where the f,* were assumed to be linear functions of the vector 
components, viz., 
Re = fapA a 


* As the sequel will show, it is a little more convenient to speak in 
terms of the square of a distance rather than the distance itself, 
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So now the distance at x, + 8x) congruent to a distance 
whose square is gauged as /? at x), is gauged as J? +- 8/2 where 
8/7 is defined by 

Of == 1° O20%as, : ; ees 


fi, 2, $3, $4 being four scalar functions of position. This 
assumes that the alteration in gauging is linearly related to the 
length itself, which seems a necessary condition if displacements 
are to preserve similarity between a set of distances at x, and 
the congruent distances at x, + 8x). 

But, as Eddington has pointed out, there is still a wider 
assumption possible, and that in postulating that the multiplier 
of /*, which gives the change in the measure, is proportional to 
/?, Weyl has still left a restriction on the geometrical treatment 
of the space-time continuum which it may be necessary to 
remove. We shall return to this point later, however, and 
proceed to develop the consequences of (1). 

As $,5%, must be independent of a change of co-ordinates, 
it follows that ¢) is a covariant vector. 

It is clear that the formule of Chapter XIV. will require 
modification in view of Weyl’s assumption, notably the expres- 
sions for the quantities fy,” and the curvature tensors. Thus it 
will be found that /,,” contains terms in addition to {Ay, v}, and 
that the Riemann-Christoffel tensor is only a part of the 
expression required in Weyl’s geometry. We shall employ the 
symbols fx,” and fay, to denote these general values, but in 
accordance with the practice in recent literature, we shall 
indicate a tensor generalised from the corresponding tensor of 
the Riemann theory by prefixing an asterisk. 

On referring to Chapter XIV., it will be observed that in 
the work leading to equation (12) of that chapter we can no 
longer put 

(d8ap/ Xe — feea — facg) A%A 8%. 


equal to zero. We must equalise it with the right-hand side of 
the equation (1) just written, where /*, being the invariant 
squared magnitude of the vector A, is given by 


BoggAtAs. . . « @) 
Hence 
(d8a8/ 0% — fpea — fucp)A%A 98x = apA*Afh dX. 


In consequence of this 


Waul %y — fur — fru = LauPy; . ° (3) 
23 
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whence it follows by steps similar to those in Chapter XIV. that 
Pruv = [Ap ¥] + d(Barbu + Sub, — Bauhy) - (4) 
frp? = De. v} + E(Qy’bu + Buh — Bau’). - (5) 


On developing the curvature tensor by the method of 
Chapter XIV., we find that after a parallel displacement round 
a small circuit the change in the component A*, Le. AA*, is 
still given by (15) of that chapter, where the tensor is still 
formally defined as in (16), but with the extended expression 
for fxy” written above. 

Thus 


and 


AA® = 4*RapkASSPy, . . . (6) 
where 
* Rye = Orv | 0% pu tek Say? /dXy + frs“fau™ xu taX 
SN i 7 6) 
ae {Av a} Ce 6 {ape, Kk} Uy,% + Opn One 
= {Au, a} U ak aii {av, K} Oe fans OU as 


where we write U),, for the tensor (symmetric in A and p) 
2(Savbu + Suh, — Sruhy) 
and Uru” = (8x bu + Bw, — Erubr). . —«  (8) 


Employing the notation (A,), still to represent covariant 
derivation, as defined in Chapter IX., i.e., as a device for deriving 
tensors by differentiation without any consideration of gauges, 


* Rawk ras Rw os (On*) = (Urut)y  U. rv Vay" 
— UyyutU aw". (9) 


Putting «=f, multiplying by g,g, and contracting we 
obtain 


Sie oc == Fare == (Can a (eae +" oe U ' + 
pa Ape / avKe Io 


The tensor Ray». is anti-symmetric as regards the indices yu 
and v, and inspection will show that the remaining terms on 
the right-hand side of (10) are also anti-symmetric with respect 
to w#andv. Hence this is also true of the tensor * Ries Sale 
another matter, however, when we consider « and A. Certainly 


} The reader must observe that the lowering of the index « inside 
the bracket which indicates covariant differentiation, is not purely 
mechanical as in the other terms, but requires a few steps which, 
however, are not difficult and involve the equation, Expiry, B}=[rp, B]- 
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Ryuve is anti-symmetric with regard to those indices, but the re- 
maining terms are not. However, they can be separated into 
two tensors, one symmetric with respect to « and A, and one 
anti-symmetric, for any tensor such as Ax, can be written as 


b(Axa etx) + B(Auar Axx); 


and the first bracket defines a symmetric tensor, the second 
an anti-symmetric. In this way it is clear that we can write 


nl ee == dey 4 5 be sa ale Gin: 


=> Dy + Fagen: 


where Aywx and Ky. are the two parts of the expression 
formed by the last four terms on the right-hand side of (ro), 
anti-symmetric and symmetric in « and A respectively; and 
Iywx is the tensor Rawx + Hawex, and is anti-symmetric with 
regard to « and A as well as w and v. The value of Ky in 
terms of the g,, and ¢, can be worked out by a series of tedious 
steps ; but it can be arrived at very readily by the following 
artifice. 

Consider the change in /* the squared magnitude of the 
vector A’ after parallel displacement round a small path. On 
the one hand, we have by (t) 


AP = Pfh.d%, ) eee 
= P2DygdSe J ( ) 
by Stokes’ Theorem, where 
Day = Ipy/d%) — 06)/2% us 


and 8S is the anti-symmetric tensor corresponding to the 
element of area enclosed by the path. 
On the other hand, 


A 
A(A Lae 
A,AA* + A*AA, 
2A,AA*, 


because it is the difference of two 
), and in consequence AA) = g),_ AA*. 


‘ ll ll 


since AA® is a vector ( 
vectors at the same point), 
Thus by (6) 
AP == AR gy AA 057 

= even At Asoo rs 

== Tepe As" 


‘by a change of dummy suffixes. 
23 
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But this is equal to 
Tapeh t Ass”, 


for the anti-symmetry of the tensor Ja. in y and « entails its 
disappearance in the summation. 


Hence 
KyapeAYA® = Dapl? 
— Di peyeAVAs, 
i.e. Kun = Lr Puy: ° Q C (13) 


Returning to equation (9), we put « = v = f and contract. 
The result is 


A Oe <== Gru et (Ox,9) u Fr (Ux) sie Ux re Uy, fU ca (14) 
The only term on the right-hand side which is not sym- 
metric in A and p is the second term, and we can easily show 


that 
U8 = 2h, 


so that the non-symmetric term in (14) is 2(¢a) x. 
Thus 


*Grp = Gray + Vau + 2(pr)u 


where Vy, is a symmetric tensor. 
This can be written 


ol OH = Gru = Vw a (Pa) u aS (Pu)a a (Pr) = (Pua 
| os 


Xe oe Dye Kxn 


==] Au = K Aue 
where /,, is a symmetric tensor and Ky, is the anti-symmetric 
tensor 
(pr)u — (Buda 
= oda/ tg — dpy/d% 
= — Ame 


This result agrees with (13), for since 


OMe = LurPuy, 
we have 


Ky pt — ok, D y 
and therefore i Pate 
Kip = g%rPua 

= dy 


Am 
A short digression may be made here to point out that certain 
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tensors possess the property of being unchanged by any altera- 
tion in the gauge system. Thus suppose that the gauges are 
changed in such a manner that an interval which was formerly 
gauged as / at a world-point is now gauged as lJ at the same 
point, y being some function of position. In consequence of 
this the gauge-vector ¢) will be altered to 4)’ where 


5 (Yl?) = pl?b,'dx4. 


So that 
pa dX%q = 8( pl?) /pl? 
= 6/7/l? + 8y/% 
= $05%_ + 8 log ¥. 
Hence 


6) =), + dlog #/dx,.  . . (16) 
In consequence 


Day’ = d6,"/d4%) — 044" /d%y 
= pul dX, — )/d%y 


== Aue 


@,, is thus such a tensor as we have just referred to. It 
may be called an “‘absolute”’ tensor, but the phrase “ in- 
tensor’ has been introduced by Eddington to distinguish it 
from tensors which do not possess this property. 

A little thought will show that *R,,,* is also an in-tensor. 
In the equation (6), A* is an “‘in-vector ”’ ; for it is a displace- 
ment, i.e., a difference of co-ordinates, and in no way depends 
on a gauge. Likewise, 6S\* depends only on co-ordinate 
differences. So an alteration of gauge will not affect *Ry,,". 
But this is not so for *Rayx. To see this, we notice first of 
all that g,, is not an in-tensor, for if the change of gauge alters 
a measured interval from /? to wi?, ie., from gagd%,5x%g to 
beag0x,0%p, the altered g,,’ will be ¥g,,. Now 


, 
*R hue — 6d) 0G iia 


(remembering that the accent refers to a change of gauges and 
not of co-ordinates). 
Hence 
RR uve = 2 ca au" 
= PLca* Rapy™ 
= WL Ruri: 


It follows that g,, and *Ry,,)» are‘not in-tensors ; Wey] calls 
them tensors of ‘“‘ weight’’ unity. In general, if the tensor 
after the change of gauge-system has a value equal to product 
of its former value by ¥”, its weight is m. Thus in-tensors 
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have the weight zero. As the determinant g is altered to 
yg by the change in gauge, and each first minor is also multi- 
plied by ¥?, it appears that g“” has the weight minus unity. 
In consequence, lowering an index of a mixed or contravariant 
tensor by association produces a tensor whose weight is larger 
by unity ; while raising an index of a mixed or covariant tensor 
reduces the weight by unity. For example, ®* is a contra- 
variant anti-symmetric tensor of weight minus two. Hence 
®,g68 is an invariant (for change of co-ordinates) of weight 
minus two. On the other hand, 


/qPagP8dw 


is an ‘‘in-invariant,”’ since dw depends only on co-ordinate 
differences and the quantity qg or ./(— g) has a weight two. 
We are now in a position to understand the suggestion which 
has been made by Weyl. He points out that the abandonment 
of the geometrical postulate that a parallel displacement round 
a closed path preserves a direction unchanged involves the 
introduction of the metric coefficients g,,, and gives us gravita- 
tion by the identification of the potentials of the gravitational 
field with these coefficients. He proposes the theory that the 
abandonment of the geometrical postulate that congruent dis- 
placement round a closed path preserves a length unchanged 
will give us electromagnetism by a similar identification of the 
potentials of the electromagnetic field with the four gauge- 
coefficients. The suggestion is certainly a fascinating one. 
We have seen how fourteen quantities are required by the physi- 
cist to define the state of the world at any point-instant, viz., 
the ten gravitational potentials and the four electromagnetic 
potentials. Ten of these have been identified with ten metric 
coefficients, and experiment has supported the identification ; 
the removal of an entirely unnecessary restriction on our geo- 
metry provides us with four more metric coefficients whose 
existence had not been thought of, just the number of physical 
quantities hitherto outside a complete identification. It is not 
unnatural to regard this as more than an accident and proceed 
to the identification suggested by Weyl. Unfortunately, no 
direct test of the theory appears to be feasible. It is entirely 
out of the question to attempt to measure the discrepancy in 
the length of a body after it has been carried round any ordinary 
circuit even a considerable number of times in any obtainable 
electromagnetic field. For, of course, the identification of the 
electromagnetic potentials with the 4)-coefficients identifies 
the field tensor with ®,,, and thus the discrepancy predicted 


‘ 


A 
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would vanish where there is no electromagnetic field, and 
increase in amount the stronger the field in which the circuit 
is situated. (See equation 12.) Weyl’s hypothesis, however, 
is attractive to the relativist for a cogent reason—it does away 
with the privileged position which gravitation has hitherto 
occupied in the general theory of Relativity. It is quite con- 
ceivable that the laws of gravitation might have been expressed 
in tensor equations quite different from those given by Einstein ; 
such laws would still have been compatible with General 
Relativity. It is even conceivable that the equations might 
have actually had the same form as Einstein’s, except that the 
ten gravitational potentials might not have been the components 
of the fundamental tensor which determines the element of 
separation in space-time. Relativity would have had nothing 
to say for or against such equations; it would have been a 
matter for the experimentalist and astronomer. In such a 
case gravitational theory and electromagnetic theory would 
have been on an equal footing ; they would both have passed 
the Relativity test, but to all appearances there would have 
been no obvious connection between the physical tensors 
involved in the formulation of these theories and any geometrical 
tensors involved in the development of the geometry of the 
world. But, as stated, the close connection between the gravi- 
tational potentials and the ten metric coefficients g,,, postulated 
by Einstein, gave a position of privilege to gravitation not 
accorded to other physical forces. As the latter forces are 
nowadays reduced to terms of electromagnetic action, it is all 
to the good, as far as the opinion of those who desire elegance 
and comprehensiveness in physical theory has weight, if we 
can without any contradiction of observed fact place electro- 
magnetism on the same plane as gravitation; and this Weyl 
has done by identifying the electromagnetic potentials with 
the ¢, coefficients. 

Eddington, however, has pointed out that even Weyl’s 
geometry retains a restrictive feature whose removal abolishes 
the necessity of connecting electromagnetism with a theoretical 
change in the length of a transported measuring rod. Going 
back to the work which leads to equation (3) above, it is clear 
that in proving that 

(2gap/ 2% — fpea — facp)A*AFdx%e 
is an invariant (for it the difference of two invariants at the 
same world-point), we have proved that 


Wap/ Xe Es I bea aa ee 
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is the component of a covariant tensor of the third order, 
since A*A 88x, is a component of a contravariant tensor of the 
third order. But, of course, it does not prove that this co- 
variant tensor is of any special type, such as the outer product 
of three covariant vectors, or the outer product of a covariant 
vector and a covariant tensor of the second order, as Weyl 
assumes since he equates the component to gag¢e. Eddington 
simply starts from the fact that it is a covariant tensor of the 
third order, and puts no limitations on its form; he abandons 
the definition contained in (1), and proceeds as follows. Since 


Way! My — Suva — fru 

day/ Xp — Fur — fry 

Wur/ 0% ia Sonu =a Surv 
are each tensors of the third order, the sum of the second two 
minus the first is also a tensor. Hence we find that 


i Apy ~~ [Ap, v] 
is a covariant tensor of the third order symmetric in and wp. 
Denote it by Uj,,, where, of course, it is no longer limited in 
form as it was above, when it was expressed in terms of gay 
and ¢,; indeed, no quantities corresponding to ¢, have been 
so far introduced. 
We have therefore 


hive — [Ap, v] he Oxia 
aul = {Ae v} ++ Udy’. 


From this we proceed to the introduction and definition of 
the in-tensor *Ra,,*, and the tensor of weight unity * Roe 
as in (9) and (Io), except that U),, is not now restricted to be 
a special tensor of the type defined in (8), an equation which 
now drops out of the analysis. 

As before, *R)y»« is anti-symmetric with respect to w and », 
and can be split into two parts—one J Awe, anti-symmetric with 
regard to « and A, and the other Ky, symmetric with 


respect to « and A, and it is not difficult to prove that 
Bayon — 4[(frrn oP rales = (rane + Sunr)e) . (17) 


When we come to the contraction of *Ra,,", we obtain (14) 
as before. Of course, 


and 


Ux,8 
is a covariant vector, and we denote it by the symbol 
== 2B). 
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So that (14) can now be written 
il OFF — Gru eae 2(Br)u tas (Uap) ao UDyaPU yp ++ LUG D ey. (18) 


For the further contraction to *G, it will be found necessary 
to introduce a symbol to represent the covariant vector U*,), 
which is quite distinct from the vector U),*, for 

U aw! = ge Mae 
while OAyy = 8 ne Oauy. 

Let us denote U*,, by Ca; and, in consequence, U*,’ by 
CA. Proceeding from (18) we obtain 


"GN, = Oy — 2(Ba)u— 8(U ay") + UrePU ne 
aie 2U%, "Ba (19) 

= G, — 2(B), — (U%W2)q + Ur PU ya* + 2U4,2B, 
(In obtaining this line we have to remember that the covariant 


derivative of g* is zero.) 
Hence, putting A = pw and contracting, 


*G = G — 2(B%)_— (C)a + 2ByC*+ UYU ype. (20) 
As before, we can write 
*Gap = Gru =f Try ae IGM 
Haye Ve Seen 


where H, is symmetric in A and yp, and, therefore, J), as well, 
while K,, is anti-symmetric. 

As — 2(B,), is the only non-symmetric term on the right- 
hand side of (18), we get 


Any = — (Ba)u — (Bua — (Unu*)a + UraUup* + 2Uay*Ba (22) 
and 


Kyu = (Bu)a — (Ba) 
Ae Seana (23) 
Since By = — 4 U),*%, it is easy to prove that 
Kn = £(dfrat/d%Xp = Ofua"/d%a). A : (23’) 


(In Weyl’s theory 
Unaruy = (Barbu + &uvPr — Bru) 
and it is easy to prove that with this restriction 
By = — $n 


Cr, = — da.) 


and 


362 RELATIVITY 


The most arresting feature, however, of Eddington’s investi- 
gation is not the analysis just outlined, but the manner in 
which he proposes to use it for the representation of physical 
facts. The older physics laid great stress on the develop- 
ment of mechanisms which would explain all processes of 
nature in terms of the behaviour of matter in bulk. That 
feature, while not exactly disappearing, is certainly one of 
diminishing importance, except for pedagogic purposes. Ein- 
stein’s theory, for example, has nothing to say concerning 
mechanisms explaining gravitation. What it does is to 
“ explain’ gravitation in terms of two principles, the principle 
of Relativity and the principle of Equivalence, through a power- 
ful mathematical calculus. Nothing more is actually needed to 
arrive at the necessary differential equations. Yet there is 
something of the arbitrary in Einstein’s choice of the law of 
gravitation among other possible and apparently reasonable 
tensor equalities which might be suggested. It is justified 
because it is probably the simplest possible, and it “ works.” 
Now we notice that, whether we know this calculus or not, 
resort to geometrical illustration is very helpful, and we can 
find geometrical statements which run parallel with the develop- 
ment of the tensor calculus itself. At once arises the question, 
is there a closer connection between these geometrical illus- 
trations and the processes of nature than was thought of when 
we entered on our task of extending the principle of Relativity 
to cover all the great generalisations of Physics? We began with 
the well-known and defined concepts of physical science, and 
brought them one by one within the grip of our tensor calculus, 
introducing tensors such as the matter and electromagnetic 
tensors, wherever required, defined in terms of these concepts. 
Einstein and his followers discovered equalities between these 
tensors and other tensors such as the G,,, which are purely 
geometrical and involve only the quantities required to define 
an interval. These equalities “work.” But what shall we say 
to the suggestion that when we survey the completed scheme 
we shall be compelled to admit that for the most comprehensive 
grasp of the “ nature of things’ we must reverse the order of 
procedure? We shall begin, it is suggested, with a four-dimen- 
sional continuum, and lay down a general quadratic expression 
for the squared distance between two contiguous points. 
Thereafter, the mathematician can develop a whole series of 
propositions by means of the tensor calculus. Certain identities 
make their appearance, some of which bear a remarkable 
resemblance to the form of the equations which are satisfied 
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by the quantities introduced by the physicist in his discussion 
of material phenomena (i.e., in his study of the four-dimen- 
sional space-time of physical events), equations which the pure 
mathematician did not consider in the development of his 
geometry. Is this an accident, or are the relations of equality 
set up between the material tensors of the physicist and the 
geometrical tensors of the mathematician nothing more than 
identities, and have the physicist and mathematician been 
approaching reality from different aspects, and been talking 
about the same thing in different terms ? 

This is ‘“‘ Geometrisation of Physics’’ with a vengeance. 
It almost seems as if each individual will accept this position 
or not according to temperament. The reasons advanced for 
its acceptance seem to carry no weight in certain quarters.* 
Yet this book would be incomplete without some reference to 
this aspect of our subject. If the reader wishes to see it 
set forth with great vigour and clarity, he cannot do better 
than read Chapter XII. on “The Nature of Things” in 
Eddington’s book, “‘ Space, Time, and Gravitation,’ and also 
his papers referred to at the beginning of this chapter. For 
what immediately follows the writer is entirely indebted to 
these publications whose author possesses a gift of picturesque 
writing as powerful as his insight into the meaning of geo- 
metrical processes. 

We have in a previous part of this chapter completed the 
development of a pure geometry of a very general kind. This 
is one of the points of vantage from which we can start in our 
attack on the problem of Nature. The other starting-point is 
our inductive study of observational science. 

We then proceed to the setting up of a physical theory 
which is based upon zdentification of the geometrical functions 
with quantities obtained by experimental measurement. 

Where can we find the first point of contact between our 
two avenues of approach? Following Weyl, we have to admit 
that a gauge-system is as arbitrary as a co-ordinate system, so 
far as the geometry is concerned; but on the observational 
side we have good reason to believe that there is a natural 
gauge-system, otherwise our attempts to identify lengths on 
distant objects like the sun and planets with lengths on the 
earth would lead to discordant and paradoxical results. Is it, 
perchance, in the successful identification of this natural 
gauge-system that we can begin the series of identifications 


* See, for example, a paper by More in the “ Phil. Mag.,”’ Nov. 1921. 
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which, according to the view expressed above, constitutes 
physical theory ? Actual physical events have probably no 
regular relation connecting them at all, none having any element 
of simplicity at all events, but on averaging over a group of 
contiguous events it may be possible that we can verify the 
existence, for the average components of displacement from 
one event to another, of a quadratic function, invariant for 
change of co-ordinates and change of gauges. This will 
necessitate the discovery of a covariant in-tensor of the second 
order by a method which does not involve the introduction of a 
metric to begin with, Now we do employ such a method in 
setting up the idea of parallelism or correspondence between 
displacements at different world-points, i.e. by introducing the 
coefficients fay”. By an obvious adaptation of the previous 
analysis we prove that 


$* RapytA as SBy 


is the vector difference between the displacement A* before and 
after a parallel transference round a small closed circuit 6S47, 
where 


*Rawk = Ofrv/X pe — Paul /d%y + favfap™ — frufar". (24) 


Thus we establish the existence of the mixed in-tensor 
*Rauw*. A contraction establishes the existence of an in-tensor 
of the second order *G),, and this can be resolved into a 
symmetric tensor J,, and an anti-symmetric tensor Ky,. Obvi- 
ously 

*GapA*AB , : : - (25) 


is an in-invariant. On account of the anti-symmetry of Ky, it 
drops out in the summation, and 


TegasAe |, ees 


is also an in-invariant expression related to two contiguous 
events. In the four-dimensional space of the mathematician 
the element of distance between two contiguous points is given 
by 
apox,dxXp : : ~ (27) 
where g,yy is a symmetric tensor. 
Let us now identify the mathematician’s four-dimensional 


continuum with the physicist’s space-time. This means that the 
Ty, of (26) must be identical with the g,, of (27), ie., 


LZ we rt28y 
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Obviously (28) could not possibly remain true for any 
arbitrary change of gauges if true for an assigned gauge-system, 
which means that (28) defines a natural gauge-system. As 
Eddington says: ‘‘ The natural gauge of the (physical) world 
is determined by measures of space and time made with 
material and optical appliances. Any apparatus used to 
measure the world is itself part of the world, so that a natural 
gauge represents the world as self-gauging.”’ 

The introduction of a metric leads, of course, to the develop- 
ment of the Christoffel indices, the ordinary curvature tensor, 
and the tensors Gy, Uay, and Hy, and the proof that Kyy is 
the curl of a vector. 

Of course, (28) not only fixes a gauge-system, but also a 
unit of length. If we wish still to make use of the centimetre 
as a convenient unit, we replace (28) by 


| yeas ie : : 20) 
where A is some universal constant. 

As we proceed in our series of identifications we shall see 
that I, differs from G,, by a tensor which will be identified with 
a tensor representing electromagnetic and electronic effects. 
So that the “ emptier ”’ the space the closer will J,, approximate 
to°Ga,. Hence in the “void ” 


Guy =DZyy. - : ‘ <- (39) 
So Einstein’s law of gravitation proves to be a “ gauging 
equation.”’ 
The ten equations (30) agree with the view that the world is 
spherical. We know that twenty conditions must be satisfied 
by a truly spherical world, viz., 


Rauyx/| KA, uv | = constant. 


(See equation (16), Chapter XIV.) Nevertheless, a world 
conditioned by 
Gu = ASwy 


will possess the more important spherical properties. In fact, 
it is we who give it this symmetrical form by our choice of the 
natural gauge. We are aware of the difficulty involved in 
transporting gauges. When we say that the lengths of two 
material objects at different places are equal, we mean that the 
length of one bears to a standard length at the same place and 
in the same direction, the same ratio as the length of the other 
bears to a standard length at the latter place and in its direction. 
Our gauging equation simply implies that we have chosen the 
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radius of curvature, of the finite world at each place and in 
each direction as the standard for any linear measure at that 
place and in that direction, and we have assigned for con- 
venience a unit which makes this standard equal to ,/(3/A) cm. 
There is no such thing as an absolute size. “ Any material 
object of specified constitution must have determined size and 
shape in relation to the radii of the world. And in so far as the 
measure of length rests on such bodies, the form of the world 
expressed in this measure is bound to be quasi-spherical.” 
(Eddington.) For an amplification of the argument, see in 
particular the paper in the ‘‘ Phil. Mag.,’’ November, 192I. 

As relativists we must, of course, see that the mathematical 
expression of this choice is a covariant equation. We have 
written down 


Gus = ALuy 


as the mathematical expression, and it certainly is covariant. 
That it agrees with the condition that the curvature at all 
points, and in all directions, is uniform can be inferred from 
the arguments of Chapter XV. It can be shown directly by 
reference to the concluding paragraphs of Chapter XIV. Thus 
representing a four-dimensional manifold with general Rieman- 
nian curvature as a hypersurface in Euclidean space of higher 
dimensions, and using the five dimensions given by the tangent 
plane hypersurface and the normal, we have for the equation of 
the four-dimensional continuum near the origin 


22 == ByX 1" Rave" > Reka? hae 


By the methods indicated in Chapter XIV. we obtain, at 
the origin (which is entirely arbitrary), 


Gi ok ky (ke =f ks of ky), 


similar results for Ggs, G33, G44, and zero values for G,, where 
w+ v. The condition Gy = Ag, at the origin becomes 


Gi as Goo = Gs cam Gay ae r, 


because at the origin 21; = £22 = 833 = 4a = I, and gy, = 0 = 
etc. 
Hence 


ky=kg =k, = hy = a/(A/3); 
which proves the statement.* 
* In a paper contributed to the ‘“‘ Phil. Mag.” Jan., 1922, Eddington 


points out that this proof is of too limited a character, and supplies one 
which takes account of the ten dimensions which must be possessed by 


THE METHOD OF IDENTIFICATION 367 


Of course, equation (30) is only true in empty space. There 
is intense local curvature in the regions occupied by electrons, 
and equation (29) is then true, there being under these circum- 
stances large differences between J, and eye 

We now proceed further with our scheme of identification. 
We have identified physical space-time with a Riemannian 
four-dimensional continuum, and we need trouble no further 
about gauges, as we have assigned a gauge-system ; so we can 
be quite content if our tensors possess the ordinary properties 
of covariance for a change of co-ordinates alone. There is no 
need to restrict ourselves to in-tensors. Let us now identify 
“ things.’”’ What geometrical tensors possess the properties of 
physical quantities by virtue of mathematical identities. Review- 
ing the content of physical science, we note three sorts of 
attributes to things which require identification : 


(1) Energy, momentum, stress. 
(2) Electromagnetic force-tensor. 
(3) Electric charge and current. 


(1) The energy tensor 7,” must satisfy the equation of 
conservation 


(TA)e= 0. 


Among our geometrical tensors we find one which satisfies 
a similar equation, viz., 


Gy’ a 48u(G — 4a), 
where @ is any constant, for we know that 
(Guta = 3(8u(G — 4))a 
is an identity. 
So we write in the first instance 
td == R[G,” aa £2u"(G ay a)], 


where k is a constant depending on units. The constant a is, 
of course, not arbitrary ; it is readily determined, for in empty 
space JT,” is to vanish, and so in empty space we have by con- 
traction 


—G+2a=0. 
Hence in empty space * 
w! = FR". 


the higher Euclidean space in order to give full scope to the “ vagaries 
of which a four-dimensional surface is capable when it has six extra 


_ dimensions to twist about in.” 
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Thus 
a= 2! 
and so 
Ty’ = RG — $8y?(G — 2d)}. : - GF 


(2) The electromagnetic force-tensor F,, must satisfy 
Fuy/ dX + WP yal/dXp —— Fay! Xy = O, 


which is the tensor expressions of two of Maxwell’s equations. 
The Curl of any covariant vector would serve our purpose here 
as far as the mathematical identity is concerned; but the 
appearance of the fundamental in-tensor K,, in the analvsis of 
our curvature in-tensor into symmetric and anti-symmetric 
parts leads us naturally to the identification of Fy, with Kyy, 
or of the electromagnetic potential vector with B, (or ¢, in 
Weyl’s more restricted geometry). We might, of course, 
identify the electromagnetic potential with C, and F,, with the 
Curl of C, ; but this seems unnatural, in view of the actual 
appearance of Ky, or Curl B, in the analysis of curvature. 


So we write 
Fuy == [Kay . . . . (32) 


where f is a constant depending on units. 
(3) The current-charge vector J# must satisfy the con- 
ditions of conservation of charge, 1.e., 


(J*)a = 0. 
Now if A” is any anti-symmetric covariant tensor, then 


((A2*)a)q = 0 


is an identity. Thus we might identify J# with (Av8)g. We 
naturally seek for some important anti-symmetric tensor for 
Avy, and Kr rises once more in our minds. This suggestion 
receives confirmation in the fact that if we identify the physical 
vector J# with the geometrical vector (K#*),, we obtain from 
(2) at once the equations 


Jo = (Ft), 


which are the remaining pair of Maxwell’s equations. 

We can now consider the grounds on which Eddington 
regards Weyl’s geometry as too limited. 

In physical space-time the world-lines of all light pulses 
emitted from a point in space at a given instant, and in all 
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directions, form a conical locus whose equation near the 
corresponding world-point is 

bs =0 
or £apoX,o%g = O. 


Now this should be a unique locus in space-time at this 
point, and Weyl argues that this requires that a vector of zero 
magnitude should remain a vector of zero magnitude after 
transference round any closed circuit back to the same point. 
This condition is certainly satisfied in his theory, for by (12) 


Al? = Pb, g8S28 


so that if J? is zero, A/? is also zero. 

This condition is not of necessity satisfied by Eddington’s 
theory, because by a glance at the analysis immediately suc- 
ceeding equation (12), 

AP = KyapeAvA dS, 


and if we do not impose Weyl’s form (13) on Kay, A? need 
not be zero, even if A* is a vector of zero magnitude.t Yet 
Eddington argues that the condition that the light pulse locus 
should be unique can be satisfied in a more general way than 
that imposed by Weyl. If the locus exists independent of 
co-ordinates and gauges, we should seek for its analogue in 
four-dimensional geometry in some in-invariant equation 
defining a unique conical locus at a definite point in the four- 
dimensional continuum. This equation can only be 


Pgd%,5%8p = 0, 


where P,y is some in-tensor. There is only one possibility ; 
Pw is *G,y. Hence the geometrical locus is 


*(4p0%X,0%B = 0, 
or, since the anti-symmetric terms drop out, the locus is 
Tp8%_%p = O. 


But since in our physical space-time the gauge has been 
chosen to satisfy 
Tuy = Sw 
the process of identification gives us . 
£ap0% 0% = O 


+ The reader should, of course, require no reminder that a zero 
. magnitude is quite consistent with finite values for the components. 


24 
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as the equation of the light pulse locus, and the necessities of 
physical science are satisfied. ; 
Reflecting on the contents of the immediately preceding 
pages, we cannot fail to see that Eddington’s wider geometry is 
consistent with a simpler physical theory than Weyl’s. If we 
adopt the view that physical theory must be expressed by means 
of equations which are to remain invariant for both gauge and 
co-ordinate transformation, then these equations must employ 
in-tensors and not merely tensors, such as Gyy, which are altered 
by a change of gauges. (G,» would, of course, be unaltered for 
a limited group of gauge transformations—those in which the 
w function of equation (16) is a constant—but not for trans- 
formations of a perfectly arbitrary nature.) This means that 
Einstein’s theory would be approximate and require generalisa- 
tion, except in regions where there exists no electromagnetic 
field. Now there is nothing to be gained by complicating our 
physical theory if it can be avoided, and Eddington has shown 
that it is unnecessary. The parallel displacement of a vector 
is a process which may yield a change in the magnitude of the 
vector or not, just according to the postulates on which we 
decide to base our geometrical reasoning. In any event, we 
arrive at a tensor *G,, which provides us with a umaque 
theoretical gauge. Without necessarily abandoning Weyl’s 
geometrical postulate, we, nevertheless, contrive to avoid the 
awkward results in our physical theory which, at the first glance, 
it would seem to entail; for we can assume as a physical fact 
(until experiment contradicts it) that any material standard, 
in being transferred to a new position or rotated into a new 
direction, adjusts itself so as to be a constant fraction of the 
radius of curvature of the world at that place and in that 
direction. This assumption replaces in our physical theory the 
former assumption that in some manner the presence of an 
electromagnetic field involves a change in the size of a material 
body if it be moved about, even if it be returned to the place 
whence it started. It is clear that in view of the possibilities 
raised by our enlarged ideas concerning geometry, some physical 
postulate is required ; and one which permits a material standard 
to retain an unambiguous size at a defined place and in a defined 
direction would, even before investigation of its consequences, 
appear to possess a decided advantage. It allows us to treat 
the magnitude of a body as something determined by its 
present situation and not modified by the persistence in it of 
the effects of its previous history. This postulate allows us 
also to retain Einstein’s gravitational equations as exact state- 
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ments, for the physical theory has no concern with ambiguities 
of gauging; a gauge has been adopted for each place and 
direction, and Einstein’s hypothesis of a unique and invariant 
separation between two assigned events is preserved. We have 
no need to enlarge our Relativity standpoint beyond that 
required by Einstein’s theory. That theory was made possible 
by the abrogation of any privileged system of co-ordinates. 
On the other hand, we find it quite unnecessary to modify 
physical theory still further in order to suit the abandonment 
of a privileged gauge ; on the contrary, we retain such a gauge 
in order to retain Einstein’s theory. The purpose which Weyl 
had in view in adopting his original physical assumption was 
the derivation from the metric of a tensor whose nature rendered 
it suitable for identification with the electromagnetic field 
tensor, and the consequent destruction of the privilege hitherto 
accorded to gravitation in the Relativity theory. But that 
purpose is still achieved by Eddington’s assumption of the 
natural gauge. The in-tensor *G,, yields the metric or gravi- 
tational tensor Jy, or guy, and also K,y,, which has the properties 
just required for a tensor which satisfies Maxwell’s equations. 
More than that, it gives yet another tensor H,,, which, although 
not identified so far with any physical tensor, might prove 
useful in enlarging the scope of our physical theory if some one 
should by some ingenious suggestion use the method of identi- 
fication not merely to recognise old friends, but to discover 
new acquaintances. 

Anyone who has not a wide acquaintance with Riemannian 
geometry might reasonably ask why the physicist confines the 
identifying process to such a small number of geometric tensors. 
No doubt the consideration of curvature in its Riemannian 
sense has given rise to tensors satisfying mathematical identities 
required for the laws of gravitation and mechanics ; and the 
generalisation of the treatment of curvature by Weyl and 
Eddington has provided us with the tensors required for the 
expression of the laws of electromagnetism. But after all, 
might there not exist other tensors having the required mathe- 
matical properties, and, if so, why should they not be employed 
in the geometrisation of physics? The answer is that the 
available tensors appear to be remarkably few. It would, 
for instance, be conceivable that the fundamental invariant 
interval is the fourth root of some in-invariant expression 
which is a quartic function of the co-ordinate differences. E.g., 


*Ro pn’ hye aABAYAE 
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might be associated with a displacement A*. Such a complica- 
tion, however, would appear to be purely gratuitous, involving 
no clear gain on the physical side. There is just the one 
in-invariant of a simple character available, viz., 


*G,gA%AB. 


We can construct a number of “ densities’ whose weights 
are zero, and which, when multiplied by the element of four- 
dimensional volume, are invariants; e.g., 


q*Gap*G®, g*Rapy* RY, QKapK®, q(*G)?. 


Invariant densities of weight zero based on a sextic funda- 
mental tensor are known. 

The number of distinct characteristics of the world expres- 
sible by these absolute invariants, which are independent of 
co-ordinate and gauge-system, are remarkably few. There is 
one striking fact, first pointed out by Weyl. Only in a con- 
tinuum of even dimensions can we have absolute invariant 
densities. The factor g or ,/ (— g) has a weight /2 in a con- 
tinuum of ~ dimensions. If ” were odd, this weight would 
involve the fraction 4, which could not be counteracted by the 
necessarily integral weight of any tensor. 

If we decide to develop our geometry in a no more com- 
plicated fashion than is required by the physical facts, there 
are only available the very tensors which have been used. 

The physical science of to-day is confronted with two diffi- 
culties, which may possibly turn out to be two aspects of the 
one problem. One has already been referred to; it is the 
impossibility of deducing the existence of the forces which hold 
the parts of an electron together from the Maxwell equations, 
however developed and amplified. The other concerns the 
fact, experimentally verified over a wide range of phenomena, 
that there exists an undoubted atomicity in the quantity 
which we call “action.” The various formule involving 
Planck’s quantum constant h, whose validity has been firmly 
established by numerous experimental researches in radiation, 
photo-electricity, atomic heats, X-rays, can all be deduced 
from a proposition of statistical mechanics ; but that proposi- 
tion itself cannot be deduced from the application of the 
classical principles of dynamics and the theory of probability. 
Now, as these principles have given way to the wider dynamics 
of Relativity, it is not unnatural to entertain the hope that in 
the development of the Relativity theory we might find some 
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justification for the introduction of a natural unit of action. 
Unfortunately, neither difficulty shows any sign of being 
resolved at the moment ; but a few words may be written as 
a conclusion to this volume, concerning the suggestions which 
have been made as to possible modes of attack. 

In the first place, if we regard electrons and protons as 
structures made up of parts, it is a ‘‘ miracle,’ to quote 
Eddington’s picturesque phrase, for an electron or proton to 
exist at all. We need some kind of external field (such as the 
Poincaré pressure) to keep it from “‘ exploding,” yet we cannot 
find any evidence for the existence of such a field in our equa- 
tions. We can, of course, introduce this field as an empirical 
principle into our body of equations, just as Planck introduced 
the hypothesis of finite regions of equal probability into 
statistical mechanics to get round the experimental discrepan- 
cies which arose in radiation, specific heat, etc. But we are 
not happy with these strange neighbours. The “ miracle ’”’ is 
very obvious in the case of an electron at rest ; but, of course, 
if Relativity is true, it is equally a miracle for an electron in 
motion. But apart from this broad application of the principle, 
the miraculousness for the moving electron can be established 
by an analysis which is instructive. Thus in an impressed 
external field an electron will move with an accelerated motion, 
and will itself produce a field of force. It is conceivable that 
this field, due to the electron’s motion, might be such as not 
only to equilibrate within the electron the external field pro- 
ducing the motion, but, in addition, to supply a component 
on each part of the electronic structure sufficient to equilibrate 
the disintegrating forces of the electron as a whole on this 
part. That this is not so can be established in detail as follows. 

For a slowly moving element de of charge, the electromag- 
netic potential at a point distant 7 from the element is 


v,0e/7, vyde/7, v,de/7, Se/7, 


provided 7 is so small that we need not concern ourselves about 
retarded values. To generalise this to a form compatible with 
Relativity is not difficult. The components are 


(de/7) V+, 


where V# is the vector %, or d%,/ds and (6e/7)) is the value as 
calculated for Galilean axes in which the electron is momen- 
tarily at rest, so that we regard (5e/7)y as an invariant. With 
this understanding we can drop the zero suffix. Considering 
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that all parts of the electron have the same velocity or V*, 
the potential due to the electron is 


Ve/delr, 


integrating throughout the electron. But this is a contra- 
variant vector, and we must employ a covariant vector in 
determining the field due to the electron’s motion. This is 


A, where 
Ay = — &uaV%/delr 
= — V,fdelr. 


The minus sign is necessary, since in Galilean axes 


ii ie O53 en ae 
The force tensor due to the electron is thus 
(Ay)u — (Au)y 


= [(Vu)y — (V,)u] /de/ Y. 


Multiplying the value of this at each point of the electron 
by the element of the electron’s charge there, we find the force 
on the electron due to its own movement is 


[(Viu)y — (Vi)ul// deydes/r12 
= [(Vu), — (V>)ule/a, 


where I/a is an average value of I/7,, over every pair of points 
in the electron. 

Now if the external field tensor is Fy, the covariant force 
vector on the electron is F,,J* where 


je = eve. 
The actual equation of motion is, of course, given by 
Mo(Xu + {aB, w}Xaxe) = -- ghePap JP, 


where m, is the proper inertial mass of the electron. (See 
Chapter X., page 223.) 
But 


Xu + {aB, p}r%orte = V2(V#)q. 
Hence 


be, ]* = — m VV)... 


1 Sire Ie = Bee MV ACs u)a 
= — MV U(Vu)a — (Va)ul 


Therefore 
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because 
ee) =70;% 
As Jt = eVe, 
it follows that 
CF wy = — mel (Viu)y — (Vo)ul- 


In consequence, if 
m, = e/a, 


the external force on the electron eF,, is just equal and opposite 
to the force on the electron due to the motion produced by the 
external field. The electron’s motion just neutralises the 
external field in the electron itself. There remains nothing 
left to equilibrate the force on the electron due to the mutual 
repulsion of its parts. 

We cannot find within the older body of theory any hope of 
deducing the existence of the cohesive forces. Added to this, 
there is another feature to which reference has been previously 
made. The invariant T of the matter tensor Ty,” is certainly 
not zero; it is the (averaged) proper density of the matter. 
But the invariant E of the electromagnetic energy tensor E,” 
is zero, showing that it is impossible to build up electrons, and 
therefore matter, from electromagnetic fields alone. Some 
other form of energy must be present, the energy of the non- 
Maxwellian forces or the Poincaré pressure. Is there any hope 
of finding its formal expression by applying the identification 
method to some of the tensors obtained in the previous parts 
of this chapter, which have not had some physical counterpart 
assigned to them? The work of Weyl and Eddington provides 
us with absolute tensor and invariant curvatures which, when 
analysed, fall into parts some of which can be identified with 
gravitational and electromagnetic tensors. There still remain 
geometrical tensors such as C, and H,, unidentified with any 
physical tensor. 

From (21) we have 


*Guy = Guy + Hy + Ky. 
Also, referring to (22), we see that 
F yy = UpePUyp* + 20 "Bo — [(C ura + (Bu)y + (By)u], (33) 
and by contraction of H“, we obtain 
H = UspyU%78 + 2B,C* — [2(B%)q + (C%)a]. + (34) 
VoV, =f. 


Hence Vi(V4), + V2(Va), = 0 
and it is easy to see that V.(V), = V2(Va)u- 
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it follows that 
T =h(4\—G). 
Now 
T = g*®I ag = g**Agap = 4A. 
Hence pry 
T=) gg 
Also 


Tv = R(G, —420(G — 4A) — 2,74] 
= biG — 4e.(G — 1) — 101 
= — AH — 4H 


Equation (36) thus gives us another expression for the 
complete matter tensor in terms of a geometric tensor, which 
is the remaining part of the absolute curvature tensor after 
removing the gravitational tensor and the electromagnetic 
tensor. Just as equation (31) gives the gravitational aspect 
of material energy, so equation (36) will give its electrical 
aspect, for it will be remembered that U,,”, the tensor from 
whose components Hy, is formed, is frau” — {Au, v}, and Weyl 
and Eddington’s theory claims this differs from zero only in 
regions where electrical energy is present. It is presumed that 
in empty space the curvature of the world, using an approxi- 
mately Galilean frame, is 10~5° (cm.)—?, and Ua,” is of this 
order of magnitude except in electric fields so intense as to 
alter the local curvature enormously. In electromagnetic 
fields external to the electron the first two terms in (33) and (34), 
being product terms, will contribute practically nothing to 
H,” and H. But when the electric field is very intense, as we 
may assume that it is within an electron, the product terms will 
be the preponderant part. Indeed, for an electron at rest we 
know by the identification of B, with the electromagnetic 
potential that (Bb), is zero. We have no physical analogue to 
fit to C), but it may reasonably vanish also in the static case 
(it does in Weyl’s limitation). Hence the mass of an isolated 
electron is, from the electrical aspect, represented in all proba- 
bility by the product terms in the expression (34) for H. In 
connection with this, it may be as well to remind the reader of 
the elementary point that in calculating the “ electromagnetic 
inertia’ of an electron in the usual manner, what we really do 
is to find a mathematical expression for the energy of the mag- 
netic field due to its motion. It is 


nera—1f(v)v? 


(36) 


THE ELECTRON 3977 


where ” is a small numerical factor depending on units, f(v) is 
a function of the velocity which approaches unity as v ap- 
proaches zero, and a is the radius of a sphere within which we 
suppose, the field not to exist or on the usual law of force we 
would obtain an infinite result. 

In short, we find in this way the added energy due to the 
electron’s motion, and derive the rest-mass by drawing an 
analogy between that and mechanical kinetic energy. But 
we do not in this way obtain directly the rest-energy of the 
electron. We have seen already in the previous chapter that 
ordinary electrical theory will not supply it all even from the 
hypothesis of a static surface charge of e on a sphere of radius a. 
None of the familiar concepts of the Maxwell theory will supply 
the necessary material for the problem. There remains the 
hope, as yet unattained, that from the tensor H,, we may 
derive the physical theory which will fill the gap and provide 
us with the conditions of stability for the electronic structure 
as well as with the missing energy. 

It might be thought that the tensors T,” and E,”, given by 


Ty” = aay kA,” ae 3 wT | 
Ee? = Fygk* — 38 PF apk 
= fP( Kuck? — $8" K apK*) 


could be brought into direct relation by means of the third 
order tensor Uy,” or fay” — {Au, v} (since Hy, and Ky, are both 
derived from U),” by defined processes), and in consequence a 
direct expression for the non-Maxwellian energy tensor T,” — E,,” 
obtained. However, the mathematical difficulties are con- 
siderable, and have not so far been surmounted. 

Eddington has arrived at an interesting conclusion con- 
cerning the natural units in which physical quantities are 
measured. This he does by applying the method of Hamil- 
tonian variation to the invariant integral 


Ji*Gap*G8tden, 


which is not altered by any change in the natural gauge, i.e. it 
is independent of the number chosen to represent A, and so is 
a pure number. (It is, of course, altered by any arbitrary 
change of gauges, in which the % function is not a constant.) 
The following is a modification of Eddington’s analysis, which 
dispenses with the variation process, and bases itself on a 
hypothesis which is admittedly very tentative, but yet has an 
element of plausibility to recommend it. 
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Consider what we may call a ‘“‘ world” gauge-co-ordinate 
system, i.e. one in which the unit gauge is the radius of the 
world, and in which observers would regard the local part of 
the co-ordinate mesh as equivalent to a Cartesian co-ordinate 
system in which two points on the same axis I cm. apart would 
have a 8x, equal to p~—1, where the world radius is p cms. ; 
briefly, the unit-mesh is estimated locally to have a width 
equal to the world radius. Now we have been identifying 
certain physical tensors with geometrical tensors which arise 
in the discussion of the curvature of the continuum which 
represents space-time. It is only natural to look for a relation 
between the units in which we measure physical quantities and 
the unit of curvature used in the measurement of the geometrical 
tensors ; or, what comes to the same thing, the unit of length 
used in the geometrical quantities. It is also plausible to 
regard the unit of a physical quantity which is directly con- 
nected with the length unit in which the world radius is unity 
as being a natural unit of the quantity. 

Now in the gauge-co-ordinate system referred to 


Tuy = Sy 


which in order to keep dimensions correct should really be 
written 


Iw = Zu X unit of curvature. 


From guy we construct as usual the indexes {Ay, v}, and thence 
the Riemannian curvature tensor Ra, and the Einstein tensor 
Gyy. A subtraction of G,, from Iyy gives Hy. It is, of course, 
impossible to equate the parts into which *Gy,, is resolved, viz., 
Ty, and Kyy, to each other anywhere, since they are tensors of 
different character, one being symmetric and one anti-sym- 
metric. But we are going to equate : 


— LpaG* + t¢,”TagG%8 
to 


KyaKee — }e,’Kag Ko 


in space unoccupied by electrons. There are no purely mathe- 
matical objections, since both expressions are mixed tensors ; 
but the reasons for the procedure are physical. Reference to 
the gauging equation and to the considerations introduced 
in Chapter XV., in the paragraphs immediately succeeding 
equation (9), shows that the first of these expressions is the 
matter-tensor expressed in some as yet unknown unit. The 
second is the electromagnetic energy-tensor in the same unit. 
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Physically, we know that these tensors are equal in an electro- 
magnetic field free of electrons. Hence, after an obvious modi- 
fication of the first expression, 


— [Gur — deu(G — 2)] = KyaKo* — YayrKapKo8 


in a purely electromagnetic field. 

Let us transform gauge and co-ordinates to a centimetre 
gauge and a mesh whose unit is estimated locally to be x centi- 
metre wide, so that indicating the transformed quantities by 
accented symbols, we have 


OX = p—1%,' = ABbx,' 


where A cm. ~? is the curvature of the world. Since /ag5x,5xg, 
and K.g6x,6%g are in-invariants, it follows that 


1 Ba) eat W lhned 
Kyy = IN Gre 


Moreover, as we have seen a little earlier, 
7 aes , 
Tuy’ = Suv 


and therefore guy = Zw’. 
(This could of course be directly deduced from the equation 


LaBoXgO%B = Agap’d%q'dxp’.) 
It follows at once that 
Guy = AT *Gyy' 5 Gu? = A~1G'p” 
ar Ne TIS, 

Hence in a purely electromagnetic field 

—A“1G' yr — Be'y(G’ — 29)] = KyaKrs — deur KanK®. 

But we know from the problem dealt with in Chapter XI. 
that the left-hand expression is equal to 

Saxv—1T'y» 

where 7,” is the matter-tensor expressed in such a unit that 
its mass component is T’,* grams per cubic centimetre. So if 
E,» is the electromagnetic energy-tensor expressed in the natural 


umt, our result shows that the matter-tensor in the electro- 
magnetic field when expressed in gram units is 


A(87rK) ~— Ey” ; 
or, in other words, the energy-density of the field is A(87«) ~ 14,4 
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grams per C.C., or c?A(8a«) — 1£,4 ergs per c.c. Now the natural 
unit electromagnetic field will be one in which twice £,4 is 
unity, i.e. one with an energy density of 


c®A/ 167K 


ergs perc.c. Taking A to be 10 ~ 5° this works out to be approxi- 
mately -0025 ergs perc.c. An electrostatic field with such an 
energy-density would have a field intensity about 4 in electro- 
static units, or 75 volts per cm. 

If we take the world radius, about 10% cm., as a natural 
unit of length, we have a natural unit of time, viz., 101°/3 
seconds the time it takes light to travel along it. Taking the 
radius of an electron as 6 X I0~1%cm., we can find the amount 
of energy within a volume of the natural unit field equal to the 
electron’s volume; it works out to 


BUS OR AUIOES., 


If we multiply this by the natural unit of time, we obtain 
approximately 


10 — 7 erg-sec. 


It is a dangerous thing to lay much stress on numerical 
coincidences, but it is striking how near this calculation brings 
us to Planck’s quantum of action. 

Glancing back, we see that 


eG = Tuy a= Kuy’ 
= A(Suv + Kyy). 


The result can be stated in general terms thus. Using a 
co-ordinate system whose mesh has a unit width equal to the 
gauge used, then the in-tensor *G,, is given by 


*Guy = A(Suv + Fy). 


Here gy, is the metric tensor whose value is obtained just 
as in Einstein’s theory. Fy, is the field-tensor in the natural 
unit introduced above (about } electrostatic unit for electric 
and } gauss for magnetic field). Also, A is the curvature of 
the world in terms of the gauge used. 

The fact that Eddington’s geometry is less restricted than 
Weyl’s can be seen very simply by observing that, while Weyl 
abandons the invariance of the expression for the line element, 
he retains the assumption that the ratio of the magnitudes of 
two line elements drawn from the same point is not altered by 
parallel displacement of the elements. Eddington introduces 
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no such postulate; he bases his geometrical theory entirely 
on the conception of parallel displacement and the analytical 
expression for it, 


SA == — fagtA xe. 


The forty quantities fi,” are, at the outset, undetermined 
functions of the co-ordinates x, He succeeds in arriving at 
a quantitative measure of anything to which the term “ struc- 
ture” can be applied, “ structure ’”’ being considered as quite 
distinct from “substance.” This measure is *G,,, and it 
resolves itself naturally into two parts, a symmetrical J,, and 
an anti-symmetrical Kyy. Itis at this point that physical theory 
makes its appearance. Einstein, for example, builds up 
separation, and hence space, time, geometry, gravitation, and 
mechanics from Jy, ; he derives Maxwell’s theory of electrical 
phenomena from K,y,. He does so by recognising the physical 
manifestations of these two geometrical tensors. Thus his 
work is not an approximation; on the contrary, as far as 
Eddington’s World geometry can lead to any conclusion, 
Einstein’s postulates and theory are exact for the natural 
geometry of the World, the significance of the word “ natural ”’ 
emerging in the recognition of Einstein’s law of gravitation as 
a definition of a physically convenient and unique gauge. 
Einstein’s work is entirely unaffected by the ambiguity of length 
comparisons which enters at the initial stages of World Geo- 
metry ; his separation is absolute, since it is an in-invariant, 
for the co-ordinate-gauge systems which he employs. 


ADDENDUM. 


There still remains, however, the master problem, viz., to 
discover the differential equations which determine the quan- 
tities fy.” ; just as the problem in the original Relativity Theory 
was to discover the differential equations satisfied by the four- 
teen coefficients, gu, and ¢, A paper* in which Einstein 
himself suggests a method for formulating such equations has 
just appeared (March, 1923), and a note giving an outline of 
this paper is added as the corrected proofs of these last pages 
are going to press. 


* « Sitz, d. Preuss. Akad.,” 1923, pp. 32-37. 
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The method is based on Hamilton’s Principle of varying an 
Action integral, which was studied in Chapter XII. We 
observed in that connection that such an integral must be an 
invariant in order to satisfy the Principle of Relativity, and so 
the integrand is a scalar-density. In order to avoid digression 
later, let us deal with one or two mathematical points in con- 
nection with scalar and tensor densities. We shall denote such 
quantities by heavy Clarendon type. 

First of all, if Aa, is any arbitrary covariant tensor, 


Aagd%adxp 


is invariant, and we can prove by the method adopted in 
Chapter VIII. in connection with the fundamental tensor of 
the original theory, that 

|A|?dw 


is an invariant, where |A| represents the determinant formed 
by the constituents of A,,. Thus |A}# is a scalar-density, and 
the product of any tensor by |A|# is a tensor-density. 
Further, owing to the invariance of Aagdxadxg, we can easily 
adapt the reasoning on page 359 of this chapter, to demon- 
strate that 
VAry/ dX — fur*Ara — frv*Apa 


is a covariant tensor of the third order. Hence it follows just 
as before that 


Pru’ — ¥A¥*(IA pal d%a + Ang! Kp — VArp/2%—q) . (37) 


is a mixed tensor of the third order, where A¥” is equal to the 
cofactor of A,» in |A| divided by |JA |). 

Now in covariant derivation we must, of course, employ 
Sau” Where we employed {Au, v} in the original theory, which 
was based on the invariance of the expression for the line- 
element as a fundamental postulate—a postulate now dropped 
in the initial stages of the theory. E.g., the covariant deriva- 
tive of a contravariant tensor Bax is now 


DBrr/dxXy + farr®B +- foytBrs, 


and we shall use the symbol (B+), to refer to it. 

Similarly, we can speak of the covariant derivative of a 
tensor-density, if we can by some form of differential operator 
obtain a tensor-density from a tensor-density. It is not difficult 
to discover such an operator. Thus let B be a contravariant 
tensor-density, then we shall prove that 
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(B%¥), = IBM/dxy + forBu + faKB — fratB (38) 


is a tensor-density. 
Putting Bw = |A|*B%, it is easily seen that 


(B™), = |A|4(Bax), + Brel frat — |A|#d|A|#/dx,]. 
Now a reference to pages 196, 197 of Chapter IX. will show 


that 
|A|—49|A|*/dx, = {va, a} (with A written for g), 


and it will be at once apparent from (37) that the coefficient 
of B* in the second term on the right-hand side is a co- 
variant vector. Hence this second term is a mixed tensor- 
density, and as the first term is also a tensor-density, the fact 
is established for (B4¥),. 

Having disposed of this question of covariant differentiation, 
we can follow Einstein’s proposed method for establishing 
fundamental equations in a new generalisation of the theory. 

The postulate of parallel displacement leads to the Riemann 
tensor 

Rayo = afar [0% — apk/d%y + fay*fau*® — faufar', 
and the contracted tensor 


Rau => Sfrat/IXu as faut /d%q + fra® fup* — fru*fap8, 


where we drop the asterisk, and write Ra, for *Gau. 
Now write 
Ruy = (Suv + Fur) 


where gy» is a symmetric tensor, F,, an anti-symmetric, and A 
the curvature of the universe expressed in terms of our usual 
standards of length, so that 


Luv — £(Ifuat/IXy _ Sfra%/ 2%) _— uv /%q + fuaP fp 
= jotta | (39) 
Muy = 3(Yfuat/d%y — ofrat/%y) 


Now let L be a scalar-density which is a function of the 
guy and Fy, and let us write 


dL/ deur eee AY 
IL/F wy = Fe | (40) 
If any arbitrary variations be imposed on the gy, and Fy, 

SL = Adgag + Fd Fag . i 5 (4x) 


and since 8L is a scalar-density, therefore A“”” and F are 
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contravariant tensor-densities, symmetric and anti-symmetric 
respectively. 
Now A” can be written in the form 


Al = (= |ANIA@ . . > aes 
where A,y is a suitably chosen symmetrical covariant tensor, 


and A” is the contravariant tensor obtained by dividing the 
cofactor of A, in |A| by |A|, so that 


Ag? = AygAre 
== Eat ev 
= 0 if p = v 
and Aa =A. 
Also let Fvy be an anti-symmetric covariant tensor such that 
FY = (— |A |)tFe”. : : - 43) 


Since F” is anti-symmetric we can establish, just as in 
page 199, that 
Py ae a 


is a covariant vector-density, or 
VEHA/dx_ = JH 
= (tay - (44) 
where J“ is a vector-density and /# a vector. 
Now consider L to be an action-density, or, in other words, 


Sida 


to be an action function, which (presuming a suitable form is 
proposed for L) will yield differential equations for the fy,” 
with a physical significance, when the process of variation is 
carried out for arbitrary variations of the fy,” and the df,y”/d%xr. 
Equations (41) and (39) enable us to express the variation in 
terms of the dfay” and the d(dfiyu”/dx,). By the usual device of 
partial integration and the rejection of a boundary integral (on 
account of the postulated vanishing of the variations of the 
parameters at this boundary) we can express the variation in 
terms of the dfry” alone. After some steps which are straight- 
forward though tedious, it can be established that 


8/Lidw = /dw3fsy*{(A®). — 44 F(A) a — $47( AB), 
+ (F6). — 34 8(F7), — $4.0 Fé2),}. 


* The minus sign under the root will be justified presently. 
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Putting the variation equal to zero, usin d . 
bering that : q , g (44), and remem 


(F47) dfs,£ = 0 
on account of the anti-symmetry of Fy, we obtain 
(AM), — 44,(A), — BAA), — FATE — $A ATA= 0. (45) 


Contracting for » and v, we obtain 


3(A%*), + 5J34=0. . : - (46) 
Combining (45) and (46), we finally arrive at 
(A#), + 4A,AS4 + 1A ywSA = 0. : 47) 


A glance at (40), (44), and (39) will show that the forty 
equations (47) are differential equations for the forty quantities 
fx’, involving the first and second differential coefficients with 
respect to the x, provided, of course, a functional form has been 
ascribed to L. A solution of these equations would then yield 
by means of (39) values for g,, and F’,,, and we would proceed 
as before to identify the invariant , 


£apd%,0%B 
as the square of the line element in the World, and the tensor 


Fy, as the covariant tensor of the electromagnetic field, the 


contravariant being fF’. 
The question now arises as to a suitable form for L. If, 


for instance, we put 
Ee 2), 

and therefore, by (6) of Chapter IX., 
Re (om AY 


and also Fr =o 
Je=05 
it can be easily derived from (46) and (47) that 
tru’ = {Au, v}. 
Equations (39) then become 
Aguy = Gur 


and we have the original gravitational equations in a complete 
vacuum. 


25 
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Guided by this result, Einstein now proposes to write 
L = 2(—|B))# . ; : . (48) 


Buy = Su + Ew 


and |B| is the determinant of the B,,-coefficients. 

This proposal would give explicit expressions for the Av” 
and J in terms of the g,, and F,y,, and would thus give definite 
form to the differential equations (47). The equations are very 
complicated, and no solution even for a single centre is attempted 
in the paper, but, if solved, they would presumably yield the 
material for constructing gravitation, electricity, and matter. It 
is a striking feature of this new proposal of Einstein’s that the 
two fields are ideally fused into one concept ; for, whereas in 
former treatments of the Action Principle, the fields, the matter, 
and the electricity were represented by separate and independent 
terms in the Action function, Einstein’s Action function cannot 
be so divided up. The mathematical treatment links all the 
physical processes up in a mathematical function depending on 
the one quantity, viz., curvature. 

Einstein points out one result which may prove to be of 
extreme importance if the new proposal be ultimately justified. 
To see how it arises we must carry the analysis a little further 
than equation (47). We can write this equation by means of 
(38) as 


VAM/Dxy + fayRAH + fad + (fva, a} — fra®)AM + LAATH 


where 


+ 34 4J* = 0, 
(where in {Aw, vy} we now understand that A is written instead 
: Seat 
VAMA/Ox, = — AMAHBOA pa/ dx, 
(see equation (3), p. 196), 
and ANH Ane Ave Ara: 


Hence writing 
xee == Ava fru® 
[Ag y| = Ayaf{rp, a} 
Ju = Anal 
we can obtain after a little manipulation 


VA ru/d%y — fryp — fara 
= An,({va, a} — fra”) + 4AryJu + 4Aw Ja. 
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Now by (37) 
{va, a} — frat 
is a covariant vector. Make for the present the tentative 
assumption that 
{va, a} — frat = af, 


where a is a constant to be determined. Then 


VA ru/ dX — fravm — fuvr 
= GAnuJy + fAn Ju + tA Jo. 
Hence writing three such equations by cyclic interchange 


of A, pw, and v, and subtracting one of them from the sum of 
the other two we have 


2[Au, v] — 2fraw = (§ — @)AruJy + @AwJn + aAn Jy 
Hence : 
2{Mu, v} — faut = (8 — a) Ard? + awn + adn’ Iu 
Putting » = v = a and contracting, 
2{ra, a} — 2frg? = (2 —a+ 4a + a)Jy. 
Hence our tentative assumption is justified if a is put equal 
to — 4, so that we obtain finally 
faut = Qu} — BAI + BAYT + BAI - (49) 
and 
fe=aohtif. . . «= (60) 
By (39) and (50) we see that 
OAP wy = df p/d%y — dJr/d%Xu. 
It follows that the vector potential of the field is 
Ju/0A, 


which shows that there must be charge and current at every 
part of the field, a result not contemplated in the earlier electro- 
magnetic theory. Of course, for ordinary values of the field- 
tensor it is apparent that the values of J, will be minutely 
small on account of the excessive smallness of A. Even for 
values of F,,, which are supposed to exist at the surface of the 
electron on the usual theory (of the order 10?® in electrostatic 
units, and therefore also in our fundamental field unit), we find 


them quite compatible with small values of J,. Of course, the 
abe 
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proper vector to be considered in discussing density of charge 
or current is J# and 


UIE VNC ie 


But even though the Av be large where the field is strong, 
it appears on consideration that the minuteness of A still keeps 
the J» of reasonable magnitude for relatively enormous values 
of F,y. The new theory, therefore, holds out hopes of con- 
structing matter from electricity without the trouble of the 
singularities involved in the earlier electron theory. The 
theory, in fact, points to a general diffusion of “ electricity ”’ 
throughout the field; but it will, to be successful, have to 
account for the existence of the magnitudes which we call 
electron charge, mass, and size. It is possible that insight into 
these problems will only come by absorbing the quantum hypo- 
thesis at some point into the general theory. 

One other point occurs to the writer as he brings this hurried 
sketch of Einstein’s latest formulation to a close. If a solution 
be obtained for equations (47) which is static, i.e. does not 
involve x,, and in which, by analogy with the static solution 
for a single centre treated in Chapter XIII., gu, and Pu, 
are zero where p = I, 2, or 3, it follows that the resulting field 
will be gravitational and magnetostatic, for Fos, F3,, Fy, repre- 
sent magnetic components and Fy4,4, Pg4, F3, electric. This 
would appear to point to the conclusion that matter is ulti- 
mately built up of magnetic charges, a view already hinted at 
in Weyl and Eddington’s writings.* 


In conclusion, the reader will observe that the Relativity 
Theory has in all its successive generalisations, from the form 
of the theory as first published in 1905, proceeded by the modi- 
fication of its mathematical method but not its point of view. In 
all cases the invariance of mathematical forms of the equations of 
Physics has been its unalterable feature. At first it involved 
as a subsidiary postulate the invariant value of the velocity of 
light, leading to the conclusion that this unique velocity is the 
limiting velocity for any moving element. In 1915 this is 
abandoned and replaced by the invariance of the line element 


*See, for example, page 211 of Eddington’s latest book, «The 
Mathematical Theory of Relativity.” 
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in curved space-time, coupled with the subsidiary postulate 
that the invariant value is zero for the element of a light- 
track. There still remained two difficulties—the apparent 
absoluteness of rotation, and the objection which carried con- 
siderable weight with some physicists that gravitation occupied 
a privileged position in the theory of 1915 as compared with 
electromagnetic force, which physically was equally important. 
The first was met by the hypothesis of a spherical or spherical- 
cylindrical world. Weyl made the first step towards removing 
the second, and Eddington, in completing that step, starts 
from the existence of parallel displacement and not the in- 
variance of a line element. Einstein now proposes a form for 
the new equations of “ gravitation-cum-electromagnetism ” 
required by this advance. It remains to be seen if some strik- 
ing experimental verification will lend that strong support 
which was not wanting in the case of the earlier forms of the 
theory, but which has hitherto been denied to the conclusions 
arrived at since Weyl introduced his idea of ambiguity in 


gauging. 


-— sapere wi 


ee ie: 
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